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1. background of the Invention • - 

11 field of the Invention 

The present invention relates generally to fie fields of molecular bmlcgy. 
More particularly, certain embodiments concern nove. nucleic .adid; 
5 gbetically-engi^d recombinant 8-endotox,ns derived J^*^;, 
Mote particular,, it concerns noveTchitneric Crystal proteins and the chimeric „£. 
segments which encode them. Vtuious merhods for making .and using these ^ 
segments, methods of pacing «. encoded proreins, methods* making syntheticaly- 
modified chimeric crystal proteins, and methods of malting and using the synthebc crystal 
,0 proteins are disclosed,,»ch aa, for example, .he use of nucleic acid segments as diagnosuc 
probes and templates for protein production, and fire use of pro*, msion prorenr came. 
„d peptides in various immunological and diagnostic applicafion, Also disclosed is me 
use o, these cry gene fusions and chimeric Cry protein, in .he deve.opmen. of rtnsgemc 
pl an.s which express broad-spectrum insecticidal activity against a variety of coleopteran, 
15 dipteran, and lepidopteran insects. 

1.2 description of Related Art 

121 Bociftos thuringiensis Crystal Proteins 

The Gram-positive soil bacterium Bacillus thuringiensis is well known for us 
20 production of proteinaceous paraspora, crysrals, o, 8-endotoxins, that are toxic to a 
variety of .epidopteran, coleopteran, and dipteran larvae. B. 

CIys tal proteins uuring spomlation which are specifically toxic .to certain spaces of 
insect, Many different strains of B. ft**-* have been shown .0 produce 
insecticidal crystal proteins, and compositions comprising B. thuringiensis strums which 

25 produce protetns having insectictda. activity have been used commercially as 
environmentally-acceptable insecticides because of their toxicity to the specific target 
insect, and non-toxicity to plants and other non-targeted organisms. 

Commercial formulations of naturally occurring B. thuringiensis isolates have 
tog been used for the btological control of agricultural insect pest,_In commerc.a, 

30 production, the spores and crystals obtained from the fomentation process are 
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conceniraied and — ed J 
practices. 



acti ve again, insets of the Order Upidopiera, ,,, caierp.ilar msec*. TJ,«~ 
Diniera 0. flies and mosquitoes) and Coleoptera (U. beetles). A number g 

a„d dipterans. c„3 genes encode coleopteran-to** Cry3 pretax 

encode r r ::: — r l ^ ^- - — 

^^11^^ an.no acid seance ^olo, ^ ,an upon ^ ^ 
specificities. This classification scheme is summarized m TABLE 1 . 
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TABLE 1 
Revised B. thuririgiensis 5-Endotqxin 



Nomenclature 



01* 



CpnRank Accession # 



CtylAb 

Cry 1 Ac . 

Cry 1 Ad 

CrylAe 

CrylBa 

CrylBb 

CrylBc 

CryiCa 

CrylCb 

CrylDa 

CrylDb 

CrylEa 

CrylEb 

CrylFa 

CrylFb 

CrylG 

CrylH 

CrylHb 

Cry2a 

Cry2b 

Cry2Ja 

CrylJb 

CrylK 

Cry2Aa 

Cry2Ab 

Cry2Ac 

Cry3A 

Cry3Ba 

Cry3Bb 

Cry3C 

Cry4A 

Cry4B 

CrySAa 

Cry5Ab 

Cry5B 

Cry6A 

Cry6B 

Cry7Aa 



CrylAta) 

CrylA(b). 

CrylA(c) . 

CrylA(d) 

CrylA(e) 

CryB 

ET5 

PEG5 

CrylC 

CryIC(b) 



PrtB 

CrylE 

CryIE(b) 

CrylF 

PrtD 

PrtA 

PrtC 

CryV 
CryV 
ET4 
ET1 

CryHA 

CryllB 

CryBC 

CryllLA 

CrylDB 

CryIlIB2 

CrylED 

CrylVA 

CrylVB 

CryVA(a) 

CryVA(b) 

Cry VIA 
CryVIB 
CrylBC 



Ml 1250 ; 

M 13898- 

Ml 1068 " ; 

M73250 

M65252 

X06711 

L32020 

Z46442 

X07518 

M97880 

X54160 

Z22511 

X53985 

M73253 

M63897 

Z22512 

Z22510 

Z22513 

U35780 

X62821 

1107642 

L32019 

U31527 

U28801 

M31738 

M23724 

X57252 ~" 

M22472- 

X17123 

M89794 ' 

X59797 

Y00423 

X07423 

L07025 

L07026 
U 19725 
■ L07022 
L07024 
M64478 — 
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Cry7Ab 
Cry8A 



CiylHG 
CryllBF 



Cry 1 OA 
CryllA 
Cry 1 IB 
Cryl2A 
CryBA 
Cryl4A 
CrylSA 
Cryl6A 
CytlA 
Cyt2A 



CrylVC 
CrylVD 
Jeg80 
CryVB 
CryVC 
CryVD 
34kDa 
cbm71 
CytA 
CytB 



GehBank Accession # 

U043o7 
UQ4364 

U04365 .. 'x ; 

U04366 ., 4 . • ' 
X58120- - ..---j 
X75019 : 

Z37527;. -'X ./">•■- 
M12662 
M31737 
X86902 
L07027 
L07023 
U13955 
M76442 
X94146 
X03182 
Z14147 




123 Mode of Crystal Protein Toxicity 

All 5-endotoxin crystals are toxic to insect larvae by ingestion. Solubilization of 
the crystal in the midgut of the .sect releases the protoxin for, of the c-endotoxm 
which in most instances, is subsequently processed to an active toxin by midgut prote- 
t ctivated toxins recognize and bind to the brush-border of the msec m^ 

f in<: Several putative crystal protein receptors have been 
epithelium through receptor proteins. Several putau y 

P , v ■ u* at nl 1995- Gill et al, 199Sr Masson et al, 

isolated from certain insect larvae (Knight et al, 1995, GUI . 

J- f tnxins is followed by intercalation and aggregation of toxm 
1QQ^ The binding of active toxins is ioiiuwcu j _ 

HI ,0 f o« .Mn fc ^ ep— TMS process leads = 
imbal a„ee. swe„ MS , lysis of .he ce,, Hning Che mi d S u, epUheHu. and eve.ua, ,arvae 
mortality. 

1 2 4 Molecular Biology of 8-Endotoxins 

With the advent of molecular genetic techniques, various 8-endotoxin genes have 
be en isolated and then DNA sequences determined. These genes have been used^ to 
construct certain genetically engineered B. ***** P"*~ that have been 
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approved for commercial use. Recent developthents have seen new 8 -er.Jo,oxin dchvery _ 
systems developed, including plants that contain and express genetically engineered: 

S-endotoxin gerves. ^ J >';;■' y- 

The cloning and sequencing of a number of 8 endotoxin genes from a vanety of 
Bacillus . .hurihgtms strains have been described and are summarized by Hofte and " 
WKiteley 1989. Plasmid shuttle vectors designed for the cloning and expressron of 
Endotoxin genes in E. coli or ft are described by Gawron-Burke and 

Baum (1991). U. S. Patent No. 5,441,884 disclose, a site-specilk- rocombinau.H. system 
for constructing recombinant B. thuringiensis strains containing S-endotoxin genes that 
are free of DNA not native to B. thuringiensis. 

The Cryl family of crystal proteins, which are primarily active against 
lepidopteran pests, are the best studied class of 5-endotoxins. The pro-toxin form of Cryl 
5-endotoxins consist of two approximately equal sized segments. The carboxyl-half, or 
pro-toxin segment, is not toxic and is thought to be important for crystal formaUon 
(Arvidson et ai, 1989). The amino-half of the protoxin comprises the acUve-toxm 
segment of the Cryl molecule and may be further divided into three structural domams as 
determined by the recently described crystallographic structure for the active toxxn 
segment of the CrylAa 5-endotoxin (Grochulski et aL, 1995). Domain 1 occupies the 
first third of the active toxin and is essential for channel formation (Thompson et ai, 
1995) Domain 2 and domain 3 occupy the middle and last third of the active toxm, 
respectively. Both domains 2 and 3 have been implicated in receptor binding and insect 
specificity, depending on the insect and 5-endotoxin being examined (Thompson et al, 
1995). 



1.2.5 Chimeric Crystal Proteins 

In recent years, researchers have focused effort on the construction of hybrid 5- 
endotoxins with the hope of producing proteins with enhanced activity or improved 
properties. Advances in the art of molecular genetics over the past decade have 
facilitated a logical and orderly approach to engineering proteinT with improved 
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properties. Si,e-specif,c and r^lon, mntage^ ^ ^MM^ 
11 — s, an, .he development of —a. fo, 

^ ^ne fnsions ann — ing « protein, have Wi.a.ed an a S sc„,en, 
of methods for changing amino acid seances of proteins, fusing porttons of , woe, mo* 
proteins together ma stngle recombinant p.,.,,.,. and -altering genetic fences that 
encode proteins of commercial interest. v ,„ . ., 

Unfortunately, for ctystal proteins, these techniques have only been explcnted « 
United fashion. The liselihood of artrilrarily creadng . ****** «* ^ - 
properties from portions of the numerous nadve prints which have 
11 given the complete nature of protetn suture, foldmg, ohgomen^ « £». 
*d conect processtng of the chimeric proloxin to an active -~<** .. 
selection of stifle target regions within each pro*in, and — n, 
engineering can toxins he synthesized which have improved insecuctdal acUvrty. 

Some suceess in ,he are. however, has heen reported in the 11— - 
exa mp,e, me con— of a few hyhrtd S-endoloxins is reported in the ^owmg . 
related an: Ml. Pat. Appl. Puhl. No. WO 95/30753 disclose* the con— of h bnd 
B. «ns, 5-endo.oxins for production in I*—-*—" - ^ 
non-toxic protoxin fragment of CtylP has heen replaced hy the 
fragment from the Cryl Ac/Cry. Ab .hat is disclosed in U. S. Patent No. 5,128 .30 _ 

segment of Cry.Ae is replaced with the corresponding non-roxic proloxin fragnuen o 
cZ U S. Patent No. 5,055,294 discloses the —on of a specf, nybr. 
slotoxio between Cry.Ac (amino acid residues 1-466, and CrylAb (amino act 

Closes the construclion of a hybrid toxin within the aelive roxin segmen no v-c . 

« presented in regard to the hybrid toxin, insecrictda, activity, mil. Par. App, ™ 

N „ WO 95/30752 discloses the construction of hybrid B. ■,hurin S iens i s 8-endotoxans 

a .™ in P foresees in which the non-toxic protoxin segment of Cr 1C is 
producuon in P. fluorescent afor emen,tioned 
replaced by the non-toxic protoxm segmen. from CtylAb. The.aro 
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appli ca t ion further discos ft* * «* * ^ - " 

5-endo.oxin is improved over that of the parent aedve toxin, CrylC. ,, 
M P« Appl. PuM. No. WO 95/06730 discloses the construction of a hybrid £. 

• s ^...oxin consisting of domams I and J of Cry* '""Pi" 1 < Umuul 3 
thuriniiensis 5-endotoxin conbi!.u»5 ..... • . . : 

t f rrvir Insect bioassays performed against 

and the non-toxic protoxin segment of CrylC. Insect y . . 

• , v t „rrv1CandCryiE),Sp^^^^ 

~" fSenSmVe t0 C ^ C ) how that the hybrid CrylE/CrylC hybrid 

and Mamestra brassicae (sensitive to CrylC) show y . 

■ ■ tM < ex taS exigua, and M tau*«*-- The bioassay results were 
toxin is active against M. sexta, S. engua ^ 

expressed as EC S0 values (toxin concentration giving a 50% growth re ^ > ^ 
J, values (toxin concentration giving 50% mortality). Although the Endotoxins used 
50 • • • ^nlv artificiallv-generated active segments 

for bioassay were produced in B. thunngiensis, only artificially ge 
for bioassay w P natural l y -produced crystals typically produced by 

rii:"^;: — : — — 

that thlfX, v,nes for fine hyhrid ^C- 
f^a were 1.5 ,0 ,.7 fold lower (more acrive) rhan for nartve CrylC * 
JL the _»on of a hyhrid — — 

contains 1 and 2, and CrylC (domain 3 and the non-toxic protoxm segment), aJthoug 
no data are given regarding the hyhrtd toxin, activity or ^ 

Lee « oi. (1995) report the construction of hybnd 8. » 
C „ 1AC md CtylAa within rhe active toxin segment. ArUfteiaUy generated 
between OylAc and Crym inttracdons in 

Ktiv e segments of the hybrid toxms were use o exarm ? ^ 
susceptible insect brush border membranes vestcles (BBMV). T 
hybrid toxins was not reported. 

Honee - I (.99.) report the construction of hybrid S-endotoxtns between 
,H ain 1) and Cty.Ab (domains 2 and 3, and the rectproca, hybrid between Cry.Ab 
(domam 1) and Oy.Ab ( These hybrids failed to show any significant 

(domain 1) and CrylC (domams 2 and 3). These hy 

Lease in activity agatnst suscep,ib,e insects. Furthermore, the CrylC (d» 
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region of Qyltt il,hough no significant increase in activity agair« susceptibie insec, ; ., 
larvae was observed. 

13 Deficiencies iii the Prior Art V.V.' V" • ; 

The tari.ed successes in producing 6— ayxtU proteins which have 
„„ piW ed acUvrry have negatrve.y impacted the fieid Oy —g efforS l0 produce 
recombinandy-engineered crystal protein for comnrereial development, and to extend the 
roxic properties and host specificities of the known endotoxin, Therefore, what rs 
iaddng in Ore prior art are reliable methods and compositions comprising recombmanrly- 
engrneered crystal proteins which have improved insecricida. activity, broad-hosr-range 
spedficit.es. and which are suirdb.e for commercial production in » 

2 Summary of the Invention 

The present invention overcomes these and other limitations in the prror art by 
providing nove. chimeric Endotoxins which have improved insecricidal propert.es, and 

broad-range specificities. 

Disclosed are methods for the construction of B. *ur,n S <ens.s hybnd 
5 .e„d 0 ,ox,ns comprising amino acid sennences from native CrylAc and CrylF crysra! 
proteins. These hybnd proreins, in whrch a., or a portion of CrylAc domain 2, all or a 
pordon of CrylAc domain 3, and all or a porrion of .he CrylAc pro.oxin segment. 
repl aced by rhe corresponding porticos of CrylF, possess no. on.y_rhe inseccr a! 
„is,ics of .he paren, Endotoxins, bu, also have rhe unexpected and remaricable 
properties of enhanced broad-range specificity which is no, proficiently displayed by 
eirher of .he natrve 8-endo.oxins from which .he chrmeric proteins were engrneemd. 

Specifically, .he present invention discloses and claims gene,ically-eng.neered 

hybrid Endotoxins which comprise a portion of a CrylAc crystal protein fused to a 

portion of a CrylF crysra, protein. These chimeric endoroxrns have broad-range 

specificiry for rhe insect pests described herein. 

in a further embodiment, .he present invention also discloses and clarms 

recombinant B. tout*** hybrid Endotoxins which comprise a portion of Cryl Ah, 
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CrylF, and CrylAc in which all or a portion of CrylAb domain 2 or all or a portion pf^ 
Cry 1 Ab domain 3 is replaced by the corresponding portions of Cry lF and all or a portion 
of the CrylAb protoxin segment is replaced by 

Exemplary hybrid ^endotoxins between CrylAb and . CrylF are identified in ; 

SEQ ID NO: 13 and SEQ ID NO: 14. '■'[ t . 

One aspect of the present invention demonstrates the unexpected^ result that 
certain hybrid 6-endotoxins denved from CrylAc and CrylF proteins exhibit not only the 
insecticidal characteristics of the parent Endotoxins, but also possess insecticidal 
activity which is not proficiently displayed by either of the parent 6-endotoxins. 

Another aspect of the invention further demonstrates the unexpected result that 
certain chimeric Cryl Ab/CrylF proteins maintain not only the insecticidal characteristics 
of the parent 8-endotoxins, but also exhibit insecticidal activity which is not displayed by 
either the native Cry 1 Ab or Cry IF endotoxins. 

The present invention also encompasses Cryl Ac/CrylF and CrylAb/CrylF hybrid 
5-endotoxins that maintain the desirable characteristics needed for commercial 
production in B. thuringiensis. Specifically, the hybrid 6-endotoxins identified in 
SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO:26, SEQ ID NO:28, and 
SEQ ID NO:30 can efficiently form proteinaceous parasporal inclusions in B. 
thuringiensis and have the favorable characteristics of solubility, protease susceptibility, 
and insecticidal activity of the parent 5-endotoxins. 

In a further embodiment, the present invention also discloses and claims 
recombinant B. thuringiensis hybrid 5-endotoxins which comprise a portion of CrylAc 
and CrylC in which all or a portion of CrylAc domain 3 is replaced by the corresponding 
portions of CrylC and all or a portion of the CrylAc protoxin segment is replaced by the 
corresponding portion of CrylC. Exemplary hybrid 5-endotoxins between CrylAc and 
CrylC are identified in SEQ ID NO:29 and SEQ ID NO:30. 

One aspect of the present invention demonstrates the unexpected result that, 
although neither CrylAc nor CrylC possess 5. frugiperda activity, the Cryl Ac/Cry 1C 
hybrid 5-endotoxin identified by SEQ ID NO:29 and SEQ ID NO:3T5 has significant 
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activity again, * the rrylAe/Cry.O hybrid End,*.*™ 

identified by SEQ ID NO:29 and SEQ ID NO:30 has signincartiy K,,er «ivity agains: 
S exigua than die OytC parental Endotoxin, , ..;;:;>/'%; ~;:;y;!,ffi|K 
" The present invention fnrrher pertains to the recombinant nne.etc ^sequences, 
which encode the novePerystB proteins diseased herein. SpeeificaUy, mention 
dieses and claims the nucleic acid i*** of SKQtWNO:*. SEQ ID NO:ll, 
SEQID NO:13, SEQ ID NO:25, SEQ ID NO:27, and SEQ ID 
tenons which are comp.emen.ar, to the nucleic acid .sequences o 
SEQ ID NO: 11, SEQIDNO:,3, SEQIDNO:25, SEQIDNO:27, and SEQ IE , NC£ ». 
^ nucleic aeid sequences which hybridize to the sequences of W? NO* 
SEQ ID NO, 1 , SEQ ID NO: .3, SEQIDNO:25, SEQ ID NO.27, and SEQ ID NO: 9. 

The novel hybrid Endotoxins disclosed hetein are useful in the control of a 
br „ad range of tnsec, pests. These hybrid Endotoxins are described 
disced in SEQIDNOtlO, SHQTD N'0:12, SEQ ID NO: 1-1. SEQIDNO.26, 
SEQ ID NO:28, and SEQ ID NO:30. The nuc,eic acid segments encoding *e« pn«n« 
are disc.osed in SEQIDNO:9, SEQIDNO:!., SEQIDNO,3 SEQE3N0.2, 
SEQ ID NO:27, and SEQ ID NO:29. The insecticide, and biochetnica. propemes ^ , the 
hybnd 5-endo.ox.ns are described by FIG. 2, FIG. 3, and TABLE 3, TABLE 4, TABLE 5, 
and TABLE 6. The broad host range of the improved Endotoxins speeffied tn the 
pr esen. invention is usefu. .n ci— ring di.ut.on effects caused by expresstng 
mu ..ip.e Endotoxin genes within a sing,e B. ,torin S Unsis strain. Expression of son 
bro ad host range Endotoxin in plants is expected .0 itnpart protection against a wtder 

variety of insect pests. 

The itnpetos for centering these and othe, hybrid Endotoxins ts to crea. 

nove, toxins with improved tnseedcida, activity, increased bost-tange sp« and 
impr oved paction characteristics. The DNA sconces hsted in TABLE 5 
exchange points for the hybrid Endotoxins pertinent to the presen. inventton and as 
„ lig onuc,eo.,de primers, may be used .0 tdenrify ..Re or simiiar hyhridEndotoxms by 
Southern or co.on, hybridization nnder conditions of moderate to high strmgency. 
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Reseat -tin- i. *■ « ~>— * » ° f "* " 

W^n «wo or mo. Endotoxins can be achieved using a number of in v.v. or in ,,™ 

m „,ecutar genede techniques. SmaU vari^ in ; 
morc 5-endotoxins may yield similar res* or, as da— d for EO, ,062 and 
ESU063, ad^ei, a*, desirable * SinWy, targe various in^^ ; 
regio , between two or more Endotoxins 

demonstrated by EG.. 063 and £0074, or may adverse., affect desirab.e trarrs, as 
demonstrated by EG1 1060 and B01 1063. ■ •■ ■ _ 

favorable .aits with regard ,o improved insecucida, acuvrty, rncreased ho* 
range, and improved producrion cbarac.ris.ics may be achieved, other sucb bybrrd 
Endotoxins induding, but no, limned .0, me cry,, cry2, cry,. cry4, cry 5 , cry6 cry7, 
cryS cryO, crylO, cryU, cry* cry,,, cry,4. cry,5 ^s of 5-endo.oxin genes and rhe B. 

cyol,ic and cy,3 genes. Members of mese off, - 

insecticidal prorerus inc.ude, bu, are not tunned to Cry.Aa, Cry.Ab. Cry. Ad, 

CrylAe Cry.Ba, CrylBb, Cry!Ca, CrylCb. Cry.Da, Cry.Db, Cry.Ea. 

C^Ba, Cry3Bb, Cry3C, Cry4a, Cry4Ba, CrySa, CrySAb, CrydAa Crv6Ba, A a, 
C^Ab CrySAa. CrygBa, CrySCa. Cry9Aa, Cry9Ba, CrySCa, Cry.OAa, Cry 1 1 Aa, 
C^A a, Cry.3Aa, CryHAa, Cry.SAa, CyUAa. and Cyt2Aa. Related ybr. 
aldoroxins would consis, of rbe ammo pordon of one of rbe aforem— 
endotoxins, including ad or par, of domain 1 or domain 2, fused to al. .parrof domarn 
3 from another of me aforementioned 5-endotoxins. Tbe non-acdve pro.oxin fragment of 
such hybrid Endotoxins ma, consis, of ,he protoxin fragment from any of the 
aforemenrroned Endotoxins wh,ch may ac, ,0 stabilise ,he hybrid 8-endo,oxm as 
demo„srra,ed by EG..0S7 and BOU». (see e.g., TABLE 3). Hybrtd 5-endoroxms 
possessing simi.ar traits as those descnbed in ,he presen, invention could be —red 
T y conseLve, or "similar" replacement of amtno acids within hybrtd 5-endoroxrn, 
Such substitutions wou.d mtmic the biochemical and biophystcal proper.es of rbe nartve 
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amino acid a. any position in ,he protein. Amino acids considered similar incWde for . 
example, but are not limited to: _ . 

Ala, Ser, and Thr 

Asp and Glu 

Asn and Gin " 

Lys and Arg 

lie, Leu, Met, and Val 

Phe.Tyr.andTrp . ' ' • - f 

Researchers skilled in the an will recognize that improved msecticdal acuvrty 
.ncreased host range, and unproved prodncdon characteristics imparted upon hybnd 
6 _„dotoxins may be fnrther improved by ajtering tbe genettc code for one or more ammo 
acid positions in the hybrid 8-endotoxin such that the position, or positions, is replaced by 
^ Cher amino acid. This may be accomplished by targeting a region or regions of .he 
protein for mutagenesis by any number of established mutagenic techniques, .ncludmg 
lose procedures reievan, ,0 the present invention. Such techniques incmde 
mut agenesis (Kunlde, .985; Kunkle « aL. 1987), DNA shuffling (S.emmer, 1994), and 
PCR™ overlap extension (Horton e, a,., .989). Since amino acids shunted a. or near the 

surf ace of a protein are UMT «P-* «* * « tth «* " 

non -pro,einaceous moieues, they ma, serve as -target" regions for "— Sue 
surface exposed regions may consist of, bn, not be limited ,o, surface exposed artu o acd 
resid ues within ,he active toxin fragment of the protein and inch.de the inter-a-hehca. or 
inter - p -strand "loop" -regions o, 8-endorox.ns that separate 

a„ d 0-strands within domain 2 and domain 3. Such procedures may favorably change the 
protein, biochemica. and biophystcal characteristics or its mode of action as outhned m 
L Sectton 1. These include, but are no, limited to: 1) improved crysta. formabon, 2) 
im p r „ved protein stability or reduced protease degradation, 3, improved insect membrane 
re cep.or recognition and btnding, 4, improved oligomerization or channel fortnatton m 
lh e insect midgut endothelium, and 5, tmproved tnsecttcida, activity or insecttctda, 
specificity due to any or all of the reasons stated above. 
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2.1 Crystal Protein Transgenes and Transgenic Plants 

In yet another aspect, the present invention provides methods for produping a 
transgenic plant which expresses a nucleic acid segment ^mmg m ^^^ 
crystal proteins of the present invention. The process of producing transgenic plants is 
well-known in the art. In general, the method comprises transforming a suitable host cell 
with a DNA segment which contains a promoter operatively linked to a coding region that 
encodes a B. thuringiensis CrylAc-lF or CrylAb-CrylF, or a Cry 1 Ab- 1 Ac- 1 F chimeric 
crystal protein. Such a coding region is generally operatively linked to a transcription- 
terminating region, whereby the promoter is capable of driving the transcription of the 
coding region in the cell, and hence providing the cell the ability to produce the 
recombinant protein in vivo. Alternatively, in instances where it is desirable to control, 
regulate, or decrease the amount of a particular recombinant crystal protein expressed in a 
particular transgenic cell, the invention also provides for the expression of crystal protein 
antisense mRNA. The use of antisense mRNA as a means of controlling or decreasing 
the amount of a given protein of interest in a cell is well-known in the art. 

Another aspect of the invention comprises a transgenic plant which express a gene 
or gene segment encoding one or more of the novel polypeptide compositions disclosed 
herein. As used herein, the term "transgenic plant" is intended to refer to a plant that has 
incorporated DNA sequences, including but not limited to genes which are perhaps not 
normally present, DNA sequences not normally transcribed into RNA ortranslated into a 
protein ("expressed"), or any other genes or DNA sequences which one desires to 
introduce into the non-transformed plant, such as genes which may normally bcpresent in 
the non-transformed plant but which one desires to either genetically engineer or to have 
altered expression. The construction and expression of synthetic B. thuringiensis genes in 
plants has been described in detail in U. S. Patent No. 5,500,365 and U. S. Patent No. 
5,380,831 (each specifically incorporated herein by reference). 

It is contemplated that in some instances the genome of a transgenic plant of the 
present invention will have been augmented through the stable introduction of one or 
more crylAc-IF, crylAb-IF, or crylAb-lAc-lF transgenes, either native, synthetically- 
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modified, or further mutated. In some instances, more than one transgene will be . 
incorporated into the genome of the transformed host plant cell. Such is the case when 
more than one crystal protein-encoding DNA segment is incorporated into 9?ff^ 
such a plant, in certain situations, it may be desirable to have one, two, three, four, or 
even more Bthuringiensis crystal proteins (either native or recombihintly^ngmeer^d) 
incorporated and stably expressed in the transformed transgenic plant. 

A preferred gene which may be introduced includes, for example, a crystal 
protein-encoding a DNA.sequence from bacterial origin, and particularly one or more of 
those described herein which are obtained from Bacillus spp. Highly preferred nucleic 
acid sequences are those obtained from B. thuringiensis, or any of those sequences winch 
have been genetically engineered to decrease or increase the insecticidal activity of the 
crystal protein in such a transformed host cell. 

Means for transforming a plant cell and the preparation of a transgenic cell line 
are well-known in the art, and are discussed herein. Vectors, plasmids, cosmids, yeast 
artificial chromosomes (YACs) and nucleic acid segments for use in transforming such 
cells will, of course, generally comprise either the operons, genes, or gene-denved 
sequences of the present invention, either native, or synthetically-derived, and particularly 
those encoding the disclosed crystal proteins. These DNA constructs can further mclude 
structures such as promoters, enhancers, polylinkers, or even gene sequences wtuch have 
positively- or negatively-regulating activity upon the particular genes of mterest as- 
desired The DNA segment or gene may encode either a native orjnodified crystal 
protein, which will be expressed in the resultant recombinant cells,_and/or which Will 
impart an improved phenotype to the regenerated plan, Nucleic acid, sequences 
optimized for expression in plants have been disclosed in Intl. Pat. Appl. Publ. No. WO 
93/07278 (specifically incorporated herein by reference). 

Such transgenic plants may be desirable for increasing the insecticidal resistance 
of a monocotyledonous or dicotyledonous plant, by incorporating into such a plant, a~ 
transgenic DNA segment encoding CrylAc-lF and/or CrylAb- IF and/or Cryl Ab-lAc-lF 
crystal protein(s) which possess broad- insects. Particularly preferred plants such as 
grains, including but not limited to com, wheat, oats, rice, maize, and barley; cotton, 
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soybeans and Cher legumes; Tees, including but no, Uri » o«an«Wal,,hrub, Mi. _ 
and nut; vegeub.es, ,urf and pasture grasses, Writs, berries, cirrus, other ciops of 
coramercial interna; including garden and houseplants. ... V;,: .,v:, ; . ■ -y. 

b a related aspect. u« prese»»in^^ 

()rif ,„a, tiansgenic plan,, produced in accordance with ,be above process, Such progeny 
and seeds will bave a s,ab,y crystid pro,ein teansgene s,ab,y incorpora,ed i„,o i.s genome, 
and sucb progeny plan, will inberi, Ore .raits afforded by the intiodocubn f a steble 
^gene in Mendelian fashion. All such <ransge„,c plants having incorporated .»,o then 
genonre transgenic DNA segments encoding one or more chimeric crystal protems or 
polypeptides are aspects of this invention. 

2 2 Crystal Protein Screening and Immunodetection Kits 

The present invention contemplates methods and kits for screening samples 
suspected of containing crystal protein polypeptides or crystal prntein-rdated 
polypeptides, or cel,s producing sucb polypeptides. Said * can contain a nuc.erc actd 
egment or an antibody of the present invention. The kit can contain reagents 
doing an interaction between a sample and a nucleic acid or antibody o, the present 
invention. The provided reagent can be radio-, fluorescently- or enzvmatically-labe ed. 
The kit can contain a known radiolabeled agenr capable of binding or interacung wtth a 
nucleic acid or antibody of die present invention. _ 

Tbe reagent of the kit can be provided as a hquid solution, attached to a sob 
support or as a dried powder. Preferably, when the reagent is provided in a hqmd 
soil, me liquid solution ts an aqueous solution. Preferably, when the reagen 
pmvlded is attached to a so.id support, the solid support can be cbromatograph medta a 
rest plate having a plurality o, wens, or a microscope slide. When tbe reagen, prov, ed ts 
a dry powder, the powder can be reconstituted by the addition of a suitable so.ven,, that 
may be provided. 

' m still further embodiments, the present invention concerns immunode.ec.ton 
m ethods and associated kits. I. ts proposed mat .he crystal proteins or peptides of .he 
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present invention may be employed to detect antibodies having reactivity therewith, or, ^ 
alternatively, antibodies prepared in accordance with the present invention, may be 
employed to detect crystal proteins or crystal protein-related epitope^containing peptides. 
In general, these methods will include first obtaining a sample suspected of cbntaining 
such a protein, peptide or antibody, contacting the sample with an antibody or peptide in 
accordance with the present invention, as the case may be, under conditions effective to 
allow the formation of an immunocomplex, and then detecting the presence of the 
immunocomplex. 

In general, the detection of immunocomplex formation is quite well known in the 
art and may be achieved through the application of numerous approaches. For example, 
the present invention contemplates the application of ELISA, RIA, immunoblot (e.g., dot 
blot) indirect immunofluorescence techniques and the like. Generally, immunocomplex 
formation will be detected through the use of a label, such as a radiolabel or an enzyme 
tag (such as alkaline phosphatase, horseradish peroxidase, or the like). Of course, one 
may find additional advantages through the use of a secondary binding ligand such as a 
second antibody or a biotin/avidin ligand binding arrangement, as is known in the art. 

For assaying purposes, it is proposed that virtually any sample suspected of 
comprising either a crystal protein or peptide or a crystal protein-related peptide or 
antibody sought to be detected, as the case may be, may be employed. It is contemplated 
that such embodiments may have application in the titering of antigen or antibody - 
samples, in the selection of hybridomas, and the like. In related embodiments, the present 
invention contemplates the preparation of kits that may be employed to detect the 
presence of crystal proteins or related peptides and/or antibodies in a sample. Samples 
may include cells, cell supematants, cell suspensions, cell extracts, enzyme fractions, 
protein extracts, or other cell-free compositions suspected of containing crystal proteins 
or peptides. Generally speaking, kits in accordance with the present invention will 
include a suitable crystal protein, peptide or an antibody directed against such a protein or 
peptide, together with an immunodetection reagent and a means for containing the 
antibody or antigen and reagent. The immunodetection reagent will typically comprise a 
label associated with the antibody or antigen, or associated with a secondary binding 
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Ugand. Exempli ligands might include, a secondary ahiibody AM**** die firs, ^ ; 
antibody or antigen or a biotin or avidin (or sueptavrduf, ligand having an 
label. Of course, as noted above, a number Of «emf*« «* g^!*^S^| 
ail such labels may be employed in connects with the ptesent invention. 

The container will generally include a vial into which the antibody, anugen or- 
detection reagent may be placid, and- preferably suitably a«|ol« ******* 
present invention will also typically include a means for containing the antibody, antigen, 
and reagent containers in close confinement for commercial saie. Such containers may 
include injection or blow-molded plastic centers into which tire desired vals are 
retained. 

2 3 ELISAs and Immunoprecipitation 

ELISAs may be used in conjunction with the invention. In an EUSA assay, 
proteins or peptides incorporating crystal protein antigen sequences are immobilized onto 
a selected surface, preferably a surface exhibiting a protein affinity such as tire wells of a 
polystyrene microtia plate. After washing to remove incompletely adsorbed matenal, rt 
is desirable to bind or coa, the assay plate wells with a nonspecific protein that is known 
,o be antigenically neutral with regard to the test antisera such as bovine serum albumin 
(BSA) casein or solutions of milk powder. This allows for blocking of nonspecific 
adsorption sites on the immobilizing surface and thus reduces the background caused by 
nonspecific binding of antisera onto the surface. _ 

After binding of antigenic material to the well, coating with a non-reacUve 
materia! .0 reduce background, and washing to remove unbound material, the 
immobilizing surface is contacted with the antisera or clinica. or biological extract to be 
rested in a manner conducive to immune complex (antigen/antibody) formation. Such 
conditions preferably include diluting the antisera with diluents such as BSA, bovme 
gamma globulin (BOG) and phosphate buffered saline (PBSyiW. These added 
agents also tend .0 assist in the reduction of nonspecific background. The layered 
antisera is then allowed to incubate for from abou. 2 to about 4 hours, at temperatures 
preferably on the order of about 25° to about 27°C. Following incubation, the ant.sera- 
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contacted surface is washed so as Co remove non-itnmunocompleXed material. ^ A ^ 
prefermd washing procedure includes washing with a SO ,u.ion such as PBS^een , Or 

borate buffer. ; : ;h A 

FoUowing formation of specific i.munocomp.exes between *e tes, -, P !e and ; 
,he hound anfigeh, and subsequent washrng, the — ce and even an,. un, o ; 
imm „.e* formation may he determined*, snhiectin, same to a second »: : hcd 
having specificity for the first. To provide a detecting means, the second antibody w.U 
pref ly have an assorted enzyme that wii, generate a co.or deveiopment.pon 
lubating with an appropriate chromogenic substrate. Thus, for exampfe, one w* des.r 
t0 contact and incubate the antisera-bound surface with a urease or ^e-comu^ 

imnttm „c„mp.ex formation <e.g„ incubation for 2 hours a, room temperature m PBS- 

containing solution such as PBS Tween*). 

After incubation with the second enzyme-tagged antibody, and subsequent to 
washi „g ,0 remove unbound materia., the amount of .ahe, is quantified by — 
wit h a chromogenic substrate such as urea and hromocreso, purpie or 2, 2-a.no-dt-P- 
erhyi-benzthiazoUnC-S-suifomc actd (ABTS) and H 2 0 2 , in the ease of p— e a be 
eJyme ,abe,. Quantitation is then achieved by measuring the degree of cotor generation, 
eg using a visible spectra spectrophotometer. 

tl anti-cry,, protein antibodies of the present invention are 
for the isoiation of other crysta, protein antigens by immunoprecip.tahom 
ft^unoprecip.ation invo.ves the separation of the target antigen compone m from 
con.p.ex mixture, and is used to discriminate or isoiate minute amounts of prorem For 
» Lation of membrane proteins ce„s must be —ed into detergent tmce«e • 
N „„,on,c safts are preferred, since other agents such as bt,e sal., pmcipitate at acd P H 

in the presence of bivalent cations. 

!n an alternative embodiment the antibod.es of the present invention are useful for 
the close juxtaposit.cn of two antigens. This is particularly useful for increasing the 
localized concentration of antigens, e.g. enzyme-substrate pairs. 
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2.4 Western Blots 

The compositions of the present invention will find great use in immunoblot or 
western blot analysis. The anti-peptide antibodies may be used as high-affinity primary 
reagents for the identification of proteins immobilized onto a solid support matrix, such 

5 as nitrocellulose, nylon or combinations - thereof! In . conjunction - ^ 
immu^oprecipitation/foUowed by gel electrophoresis, these may^be ^sed as.a smgift step, 
reagent for use in detecting antigens against which secondary reagents used in the 
detection of the antigen cause an adverse background. Tins is especially useful when the 
antigens studied are immunoglobulins (precluding the use of immunoglobulins bmdmg 

10 bacterial cell wall components), the antigens studied cross-react with the detecting agent, 
or they migrate at the same relative molecular weight as a cross-reacting signal. 

Immunologically-based detection methods for use in conjunction with Western 
blotting include enzymatically-, radiolabel-, or fluorescently-tagged secondary antibodies 
against the toxin moiety are considered to be of particular use in this regard. 

15 

2.5 Epitopic Core Sequences 

The present invention is also directed to protein or peptide compositions, free 
from total cells and other peptides, which comprise a purified protein or peptide which 
incorporates an epitope that is immunologically cross-reactive with one or more anu- 
20 crystal protein antibodies. In particular, the invention concerns epitopic core sequences 
derived from Cry proteins or peptides. _ 

As used herein, the term "incorporating an epitope(s) that is immunologically 
cross-reactive with one or more anti-crystal protein antibodies" is intended to refer to a 
peptide or protein antigen which includes a primary, secondary or tertiary structure 
25 similar to an epitope located within a crystal protein or polypeptide. The level of 
similarity will generally be to such a degree that monoclonal or polyclonal antibodies 
directed against the crystal protein or polypeptide will also bind to, react with, or 
otherwise recognize, the cross-reactive peptide or protein antigen. Various immunoassay 
methods may be employed in conjunction with such antibodies, such as, for example, 
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. . „. M ELISA RIA and.heUke.aUofwhioh.ar.taoWn.o.thoseof^Um* _ :. 
Western blotting, bLlbA, wn. * 

^ r <■ ft f Crv immunodominant epitopes, and/or to fWGtional 

The identification of Cry lmmunuu 

■* "noes on .he basis of hydrophUicUy. The memods 
epitopes from aro.no ac,d 

— * — l - wo,. ** - 

t „ identify epi.op.0 corr , se,u -» ( ■ ^ rf ^ ^ C0K 

ofpepadesyn.hesisorreoonrbinan.—y. ^ gwerall , 

Purred peprides for ose in accordance wrth rhe p«sent . 

, f l ...o to ^out 20 amino acids in length, and mote preteraoty 
„e on the order of abou 8 to ah ^ ^ protti ,t- 

potion, and advantageous biodtstributron. ^ ^ 

lt „ proposed .ha, pa.icu.ar advantages of the pmsen, m 

.hrough dre preparatron of syn.he.ic pep.,— - de— _ 
eprtopic/immunogenic core seances whtch r«ul. m ^ ^ 

— - ^ ^.2^^^- hydrophilic region, 
epitopic core se q ue„ces are , den « b« ^ ^ 
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epitop ,c core seance is one .a, Wil, ehci, — * - " 

lies to^H** - peptide compositions or rhe preset «~ > - * 

comply 

toaniho acids or peptides U*** » » foree 

certain epitope core seances of * present invention thay * op*««n* - • 
LTor ,Jt a b ,, ,0 compete wrm or per* displace mending 
protein antigen with the corresponding protein-directed antrsera 

to genera., .he size of the polypeptide antigen is no, belteved .o be pamcnlarly 
crucial so long as i. is a, leas, large enongh .0 carry the identified core seance 

t a tw this size will generally correspond to the smallest 
in m 20 being more preferred. Thus, tnis size win g, 

use in predicting antigenic poruons of proterns (see e.g.. 

„ „ ,988). Comparers peptide se q nence analysts programs (e.g.. DNASUr 
JL ON-ar, Inc.. Madtson, W„ may aiso be usefn, in designing synthertc 

Svntheses of epitopic sequencer, 
withi „ 1 seance, are readi,, achieved nsing —a, *~j£T £ 
as th e sofid phase methoa (e.,, rhrongh Ore use of commerctaliy avarlabe p^ 
sy „ th es,zer such as an Apphed Biosysrems Model 430A Peptide Syn.hesrzer, P P«d 

slore d in conventional manners, such as in a q ueons solutions or. even more preferably, 
a powder or lyophilized state pending use. _ 
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b genera., due ,0 .he re.at.ve stability of peptides, drey may be readily stored m _ 
aqueous so.ur.ons for fairly .ong periods of , .me if desired, ,*.,up .0 si, months » 
1 virtu* any aqueous so.urion wrthou, appreeiab.e degradation or ,oss of ant-genre 
acuity HOwevef, where extended aquedus storage is eon.emp.ared i. wiU generady be 
JL ,0 ,c,de a.enrs „c,udi:, g buffer, sncn * W. o: phosphate buffer, ,0 mar.Ua.rr 
apHo^^abou,,., uM****^*"^^ 

m an aqueous s.a t e r. wiU be desirab.e .0 s.ore .he so.utions a. abou, 4 C. more 
pre ferab,y, frozen. Of course, where One peptides ate stored in a .yoph.ti.eti or powdered 
^! J may be stored vrrtuahy .ndefhnte.y, ,,, in metered atiquots .ha, may he 
Xdr tl wr, a predetermined amount of water (preferab, d,ti..ed, or buffer prror to 



use. 



2 6 Nudeic Aeid Segment Eneoding Crystal Protein Chtoeras _ 

mutagenized, that ean be synthesized, or iso,a«ed from virrually any « 
Zm total genomic DNA and rba, eneode dre nove. chrmerre peptides drse osed he.m- 
DNA segmenrs encoding these peptide spee.es may prove .0 eneode pro^ 

L reld gene produers. h, addr.ion these DNA segments may he synthesrzed entrre.y 
,•„ musing merhods rha. are we.l-tnown ,0 rhose of sh... in the art 

As used herein, rhe » "DNA segment" refers .0 a DNA molecule that has be« 
isol a,ed free of rora, genonnc DMA of a part.cu.ar speo.es. Therefore, a DNA se*»e« 

r ^ o nKFA segment that contains crysiai 
encoding a crystal protein or peptide refers to a DNA segment 

I Lrng Jnees ye. is iso.ated away from, or purrf.ed free from - ^ 
DNA of tite species from which the DNA segment , obtained, wbrch rn the ,ns«n. «. 
> ,be genome of the Gram-positive bacteria, genus, BaciUus, and .» padrcu.ar, . 

1 DNA segmenrs and smader fragmenrs of such segmenrs, and.so reoombman. 



-23- 



A: 5V5S7( iy270H.DOC) 



,,.,,s. including, for examp.e, plasmMs. cosn*. phagemids, phage, v.ruse, and the . 
like • k - ' 

' SimMy, a UNA segment comprising an isolated, or purified ^ protem- 
encoding gene refers Co a DNA segment which may mclude in addifion ,o pep,* 
Iln fences, certam other e.eme„ B such * regu.a.0. ? ,uen ffi — 
IranL ^ from o.Ker natumHy occurring gen. or profe^odi,^ 
b .is respecr, *e rerm "gene" rs used for simp!,., » refer ,o a firneUon^ . 
pn.ypepfide- or pepude-encoding uni, As wU, be understood by those ,u * 
Una, term inciudes both genom, seduces, operon sequences and n^ e 
engineer gene segnreurs .ha, express, or may be adapred ,0 express, prorems, 

^Cl— aiiy away fromo te codin g se q ueuce,me a us th ar fire gene. 
int eresr, in fids case, a gene encoding a oacreriai cys«al protein, forms the srgnrficarupan 
I ,he coding region of the DNA segment, and fira, the DNA segmen, does not centaur 

I,! P 1I oLuraily-occurring coding DNA, such as _ — om = , 
JJL — genes or operon coding regions. Of ««. .rs n*» » * e DNA 

S egmen, as origmaHy isolaled. and does no. exclude genes, m— genes, syntheuc 

^-^^-^- fc T - ^:t^ segments and 
In particular embodiments, the invention concerns rsolated DN 

^omhinlr vectors incoming DNA sequences ma, encode a Cry peptide spec.es h 

rdeswifi.n^noacidsequenceanamiuoacidsequen.easeufi^ 

SEQ ID NO: 10, SEQ ID NO: 12, SEQIDNO:,4, SEQ ID NO:26, SEQ ID N0.28, 

SEQIDNO:30. SEO TDNO10 SEQIDNO:12, 

The term "a sequence essentially as set forth in SEQ ID NO. 10, y 
SEQIDNO:!, SEQ ID NO:26, SEQIDN0.28, or SEQ 0, NO:30" me^.ha. * 
sequence subs.an.ial,y cormsponds .0 a por.,o„ of the sequence of ^ » 
SEQ ID NO" 12. SEQ ID NO: 14, SEQ ID NO:26, SEQ ID NO:28, or SEQ ID NO.30 
1 ILy ew Ino acids lha, are no, identical to, or a bio.ogically foncuona 
eqnivalen, of, the amino acids of any of these sequences. The term « 
flciona, equivalent" is wel, understood in the ar, and rs further define, tn de.a.1 herein 
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„, see Mus.ra.ive Embodiments). Accordingly, seances ,ha. have bet.eS, abo». _ 
70% and about 80%, or more preferably between about 8« and abbu, 90%, br^n 
more preferably between about 9,% and about 99% amino J^.W^W^. 
fnnctiona, equivafehee .o .be a«no acld S of SEQ<DNO:,0. ,SEQIDNO,2. 
SEQID ND-14, SEQ H> NO:26, S EQTDNO:28, „■ SCO ID NO*> wi.l be fences 
(hat are «>ly as * «h > SEQIDNO,0. SEQ.DSO:12, .SEQIDNO:M. 
SEQIDNO:26,SEQIDNO:28,orSEQIDNO : 30.- 

„ wiU also be understood that amino acid and nuoieic acid sequences may mciude 
addiuoaa. residues, such as additional N- or Ctermina! amino acids or 5' or 3< sequences, 
and y e, m beessentially as se, fortb in one of.be sequences so .*«. 

th e seouence meets me crireria se. forth above, including .he mauuenance o, b.oiog.^ 
protei activity where protein expression is conceded. The; addition of : termmaf 
equences particularly applies to nucleic acid sequences that may, for example, mclude 
various non-coding sequences flanKing either of the 5' or 3' portions o, me codmg regmn 
or may include various internal sequences. ,,, inmans. which are known .o occur wrthm . 

8eneS : The nucleic acrd segment of dae presen, invendon, regardless of me ,eng.h of .he 
coding sequence itsem may be combined with o.her DNA sequences, such as promo,ers 
pjenylauon signals, addUional restriction e„,yme sHes, mu.b,e doom, " _ 
Ling segments, and .he U* such that .heir overall lengrh may vary eons.demh * B - 
.herefore con.emp,a.ed .hat a nudeic acid fragmen. of almost any_.eng.h may 
employed, wida the total lengm preferably being limbed by the 

he preparer! that include a short contiguous stretch encoding either of .he pepude 

SEQ H) NO:28, or SEQ ID NO:30, or tha, are rdendcal .0 or «*» - 
sequences which encode any of me peptides disclosed in SEQ EO NO: ,0, SEQ ID NO. 2 
S EQD 3NO,4, SEQIDNO:26. S EQ ID N 0:28. or SEQ K> NO : 30, and part.cu.aa y 
JL DNA segmems dise.oaed in SEQ ID NO:9, SEQ ID NO: 1 1^ SEQ ID NO: 3, 
SEQ ID NO:25, SEQ ID NO:27. or SEQ ID NO:29. For example, DNA sequences such 
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as about 14 nucleotides, and that are up to about 10,000, attmt 5,000, about 3,000, about .. 
2,000, about 1,000, about 500, about 200, about 100, about 50, and about- 14 Base pairs in 
length (including all intermediate lengths) are also contemplated to be^seful. , ; ■ ^ ; : 

It Will be readily understood that "intermediate lengthen tfiese cortte^s, mean 5 - 
any length between the quoted ranges, such as 14,15, l^iflf 
etc- 30,31,32, etc, 50,51,52, 53,^,100, li,ltt«^tl^^ 
including all integers through the 200-500; 500-1,000; 1,000-2,000; 2,000-3,000; 3,000- 
5 000; and up to and including sequences of about 10,000 nucleotides and the like. 

It will also be understood that this invention is not limited to the particular nucleic 
acid sequences which encode peptides of the present invention, or which encode the 
amino acid sequences of SEQIDNO:10, SEQE^O:12, SEQIDNO:14, 
SEQIDNO-26, SEQIDNO:28, or SEQ ID NO:30, including those DN A sequences 
which are particularly disclosed in SEQIDNO:9, SEQIDNO-ll SEQIDNO:13, 
SEQ ID NO:25, SEQ ID NO:27 or SEQ ID NO:29. Recombinant vectors and isolated 
DNA segments may therefore variously include the pepdde-coding regions themselves, 
coding regions bearing selected alterations or modifications in the basic coding region, or 
they may encode larger polypeptides that nevertheless include these peptide-coding 
regions or may encode biologically functional equivalent proteins or peptides that have 

variant amino acids sequences. 

The DNA segments of the present invention encompass biologically-functional, 
equivalent peptides. Such sequences may arise as a consequence of codon redundancy 
and functional equivalency that are known to occur naturally within nucleic acid 
sequences and the proteins thus encoded. Alternatively, functionally-equivalent proteins 
or peptides may be created via the application of recombinant DNA technology, in which 
changes in the protein structure may be engineered, based on considerations of the 
properties of the amino acids being exchanged. Changes designed by man may be 
introduced through the application of site-directed mutagenesis techniques, e.g., to 
introduce improvements to the antigenicity of the protein or to test mutants in order to 
examine activity at the molecular level. 
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If desired, one may also prepare fusion proteins and peptides, e.g., where the ^ 
peptide-coding regions are aligned within thb same expression uhif with other proems (>r 
peptides having desired functions, such as for purifica^-or ,«m|^ee^|^^ 
(e.g., proteins that may be purified by affinity chxp^ <*>* ft V 
regions, respectively). - . *- ■ ' ,V ... 2 

Recombinant vectors form further aspects of the^esentf ve|^ 
useful vectors are contemplated to be those vectors in which the coding portion of the 
DNA segment, whether encoding a full length protein or smaller peptide, is positioned 
under the control of a promoter. The promoter may be in the form of the promoter that is 
naturally associated with a gene encoding peptides of the present invention, as may be 
obtained by isolating the 5' non-coding sequences located upstream of the coding 
segment or exon, for example, using recombinant cloning and/or PCR™ technology, in. 
connection with the compositions disclosed herein. 

2.7 Recombinant Vectors and Protein Expression 

In other embodiments, it is contemplated that certain advantages will be gained by 
positioning the coding DNA segment under the control of a recombinant, or heterologous, 
promoter. As used herein, a recombinant or heterologous promoter is intended to refer to 
a promoter that is not normally associated with a DNA segment encoding a crystal protein 
or peptide in its natural environment. Such promoters may include promoters normally " 
associated with other genes, and/or promoters isolated from any_bacterial, viral, 
eukaryotic, or plant cell. Naturally, it will be important to employ a promoter that 
effectively directs the expression of the DNA segment in the cell type, organism, or even 
animal, chosen for expression. The use of promoter and cell type combinations for 
protein expression is generally known to those of skill in the art of molecular biology, for 
example, see Sambrook et al, 1989. The promoters employed may be constitutive, or 
inducible, and can be used under the appropriate conditions to direct high level- 
expression of the introduced DNA segment, such as is advantageous in the large-scale 
production of recombinant proteins or peptides. Appropriate promoter systems 
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contemplate* for use in high-level expression include, <H*» no. limited .0, the Fickia _ 
expression vector system (Pharmacia LKB Biotechnology). 

b connection with expression embodiments to prepare recombinanr prote.ns a:,d 

peptides, it is contemplated «*, longer 

DNA segments encoding me entire peptide sequence being most preferred. However, t, 
will be appreciated mat the * of «*r DNA .segments to direct the ex*,e|sioh .0 
crystal peptides or epitopic core regions, such as may be u sed to generare antt-crysta, 
protein antibodies, also falls within the scope of the mvention. DNA segments that 
encode peptide antigens from about 8 to about 50 amhto acids in length, or more 
preferably, from about 8 to about 30 amino acids in length, or even more preferably, from 
M 8 to about 20 amino acids in lengm am contemplated to be partiemariy useml. 
Such peptide epitopes may be amino acid sequences which comprise contiguous ammo 
acid sequences from SEQfDNOlO, SEQfDNO:12 SEQ ID NO:14, S EQfX>NO:26 
SEO ID NO-28, or SEQ ID NO:30; or any peptide epitope encoded by the nucletc actd 
sequences of SEQlDNO:9, SEQIDNOtll, SEQIDNO:13, SEQ ID NO:25, 

SEQ ID NO:27, ot SEQ ID NO:29. 

Methods for the recombinant expmssion of crystal proteins and vectors useful m 
the exptession of DNA constructs encoding crystal proteins are described m to,.. Pat. 
Appl. Publ. No. WO 95/02058, specifically incorporated herein by reference. 

2 8 DNA Segments as Hybridization Probes and Primers 

h addition to their use in directing the expression of crystal proteins or pepttdes 
of the present mvention, the nucleic acid sequences contemplated herein also have a 
variety of other uses. For example, they also have utility as probes ot primers in nucletc 
acid hybridization embodiments. As such, it is contemplated that nucleic acid segments 
tha, comprise a sequence tegion that consists of a. least a ,4 nucleotide long contiguous 
sequence that has the same sequence as, or is complement to, a ,4 nucleotide long 

t nf cpnTDNO-9 SEQIDNO:ll, SEQEDNO:13, 
contiguous DNA segment of SEQIDINU.y, ^ 

SEQIDNO-25 SEQ ID NO:27, or SEQ ID NO:29 will find particular utility. Also, 
nucleic acid segments which encode at least a 6 ammo acd contiguous sequence from 
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SRQ ,DNO:10, SKQ1DNOU SEQU1N014. SEQ1D NO:26. SEQ >D NO:28, or _ 
SEQ ffi NO:30, M also prefer. Longer contiguous identical - 
fences. ,,, those of abou, ,0, 30, 40, 50. .00, 200,500, >000 ; ^^0000 
< in c,uding a» tatentieuiatelengmsandupto and inctadtag fu.We W h seoucnces w,U 
also be of use in certain embodiments. - ■; 

The ability of such nncleic acid probes to specifically hybridize to crystal protem- 
encoding seances will enable them to be of use in detecting the presence of 
complementary sciences in a given sample. However, other uses are — . . 
including .he use of the sequence information for «he preparation of mutant spec.es 
. primers, or primers for use in preparing other genetic constructions. 

Nucleic acid molecules having sequence regions consisting of contiguous 
nucleotide stretches of 10-14, .5-20, 30, 50, or even of 100-2* nucleotides or -SO, 
Identical or complementary .o DNA sequences of SEQIDNO:9, SEQIDNO.ll, 
SEQ ID NO: 13, SEQIDNO:25, SEQmNO:27, or SEQ ID NO:29, are partly 
contemplated as hybridization probes for use in, ,,. Southern and Northern blotting. 
Smaller fragment will generally find use in hybridization embodiments, wherern .he 
iength of the contiguous complementary region may be varied, such as he.wee„ abou, 10- 
,4 and abou. 100 or 200 nucleotides, but larger contiguous complemented stretches 
m ay be used, according .0 .he length complementary sequences one wishes to detect. 

Of course, fragments may also be chained by other techniques such as, e.g., by 
ne chanical shearing or by restriction enzyme digestion. Small nucleicacid segments or 
fragments may be readily prepated by, for example, directly synthesizing the fragment by 
chemical means, as is commonly practiced using an automated ohgonucleot.de 
sy „,he,zer. Also, fragments may be obtained by application of nucleic acid reproduce 
technology, such as the PCR™ technology of U. S. Patent Nos. 4,683,195 and 4,683,202 
(each specifically incorporated he,ein by tefemnce), by introducing selected sequences 
i„,„ recombinant vectors for recombinant production, and by other recombinant DNA 
techniques generally known to those of skill in the art of molecular biology. 

Accordingly, the nucleotide sequences of the invention may be used for then 
ability to selectively form duplex molecules with complementary stretches of DNA 
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fragments. Depending on the application envisioned, one will desire to employ varying ^ 
conditions of hybridization to achieve varying degrees of selectivity of probe towards 
target sequence. For applications requiring high selectivity, one wUl typically desire to 
employ relatively stringent conditions to form the hybrids, e.g., one Avill select restively 
low salt and/or high temperature conditions, such as provided by ,abou te 0.02 ; M to about 
0.15 M NaCl at temperatures of about 50°C to about 70°C. Such Elective conditions 
tolerate little, if any, mismatch between the probe and the template or target strand, and 
would be particularly suitable for isolating crystal protein-encoding DNA segments. 
Detection of DNA segments via hybridization is well-known to those of skill in the art, 
and the teachings of U. S. Patent Nos. 4,965,188 and 5,176,995 (each specifically 
incorporated herein by reference) are exemplary of the methods of hybridization analyses. 
Teachings such as those found in the texts of Maloy et al, 1994; Segal 1976; Prbkop, 
1991; and Kuby, 1994, are particularly relevant. 

Of course, for some applications, for example, where one desires to prepare 
mutants employing a mutant primer strand hybridized to an underlying template or where 
one seeks to isolate crystal protein-encoding sequences from related species, functional 
equivalents, or the like, less stringent hybridization conditions will typically be needed in 
order to allow formation of the heteroduplex. In these circumstances, one may desire to 
employ conditions such as about 0.15 M to about 0.9 M salt, at temperatures ranging 
from about 20°C to about 55°C Cross-hybridizing species can thereby be readily 
identified as positively hybridizing signals with respect to control hybridizations. In any 
case, it is generally appreciated that conditions can be rendered.more stringent by the 
addition of increasing amounts of formamide, which serves to destabilize the hybrid 
duplex in the same manner as increased temperature. Thus, hybridization conditions can 
be readily manipulated, and thus will generally be a method of choice depending on the 
desired results. 

In certain embodiments, it will be advantageous to employ nucleic acid sequences 
of the present invention in combination with an appropriate means, such as a label, for 
determining hybridization. A wide variety of appropriate indicator means are known in 
the art, including fluorescent, radioactive, enzymatic or other ligands, such as 
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avidin/biofin, which are capable of giving a detectable signal. I„ pefected embodtments. 
one will likely desire to employ a fluorescent label or an enzyme tag, such as urease, 
alkaline phosphatase or peroxidase, instead of radioactive or other enyironnynu!, . 
nndesirable reagents. In the case of enzyme tags, cdori.ne.ric indicator sulfate, are 
known tha. can be employed .0 provide a means visible „. .he human eye or 
spectrophotomirically, to identify specific hybridization wi,h complementary nucle.c 

acid-containing samples. 

In general, it is envisioned that the hybridization probes described herein will be 
useful both as reagents in solution hybridization as well as in embodiments employing a 
solid phase. In embodiments involving asolid phase, the test DNA (or RNA) is adsorbed 
or otherwise affixed to a selected matrix or surface. This fixed, single-stranded nuclei 
acid is then subjected to specific hybridization with selected probes under desired 
conditions. The selected conditions will depend on the particular circumstances based on 
the particular criteria required (depending, for example, on the G + C content, type of 
target nucleic acid, source of nucleic acid, size of hybridization probe, etc.). Following . 
washing of the hybridized surface so as to remove nonspecifically bound probe 
molecules, specific hybridization is detected, or even quantitated, by means of the label. 

2 9 Biological Functional Equivalents 

Modification and changes may be made in the structure of the peptides of the- 
presen. invention and DNA segments which encode .hem and still obtain a funcUonal 
nrolecule that encodes a protein or peptide with desirable characteristics. The followmg 
is a discussion based upon changing the amino acids of a protein to create an.equ,vale„t, 
or even an improved, second-generation molecule. In particular embodiments of the 
invention, mutated crystal proteins are contemplated to be useful for increasmg the 
insecticidal activity of the protein, and consequently increasmg .he insecticidal acttv.ty 
and/or expression of the recombinant transgene in a plan. cell. The amino acid changetr 
.nay be achieved by changing .he codons of the DNA sequence, according to the codons 
given in TABLE 2. 
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TABIE2 



Amino Acid 



Codoris 



. > ... 



Alanine 


Ala 


A 


GCA 


GCC ••• 


GCG 


GCU 


Gysteine 


Cys 


c 


UGC 


UGU 






Aspartic acid 


Asp 


D 


GAG 


GAU 






Glutamic acid 


Glu 


E 


GAA 


GAG 




- 


Phenylalanine 


Phe 


F 


UUC 


UUU 






Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 


Histidine 


His 


H 


CAC 


CAU 






Isoleucine 


lie 


I 


AUA 


AUC 


AUU 




Lysine 


Lys 


K 


AAA 


AAG 






Leucine 


Leu 


L 


UUA 


UUG 


CUA 


/-ITT/"* 

cue 


Methionine 


Met 


M 


AUG 








Asparagine 


Asn 


.N 


AAC 


AAU 






Proline 


Pro 


P 


CCA' 


ccx 




ecu 


Glutamine 


Gin 


Q 


CAA 


PAH 






Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


Serine 


Ser 


s 


AGC 


AGU 


UCA 


UCC 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 


Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 


Tryptophan 


Trp 


w 


UGG 








Tyrosine 


Tyr 


Y 


UAC 


UAU 







CGG 
UCG 



CGU 
UCU 



For example, certain amino acids may be substituted for other amino acids in a 
protein structure without appreciable loss of interactive binding capacity with structures 
such as, for example, antigen-binding regions of antibodies or binding sites on substrate 
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molecules. Since it is the interactive capacity and nature of a protein that defines that 
protein's biological functional activity, certain amino acid sequence substitutions can be 
made in a protein sequence, and, of course, its underlying DNA coding sequence, and 
nevertheless obtain a protein with like properties. It is thus contemplated by the inventors . 
that various changes may be made in the peptide sequences of the disclosed 
compositions, or corresponding DNA sequences which encode said peptides without 
appreciable loss of their biological utility or activity. 

In making such changes, the hydropathic index of amino acids may be considered. 
The importance of the hydropathic amino acid index in conferring interactive biologic 
function on a protein is generally understood in the art (Kyte and Doolittle, 19-82, 
incorporate herein by reference). It is accepted that the relative hydropathic character of 
the amino acid contributes to the secondary structure of the resultant protein, which in 
turn defines the interaction of the protein with other molecules, for example, enzymes, 
substrates, receptors, DNA, antibodies, antigens, and the like. 

Each amino acid has been assigned a hydropathic index on the basis of their 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982), these are: isoleucine 
(+4.5); valine (44.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); 
methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); 
tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (^1.5). 

It is known in the art that certain amino acids may be substituted by other amino 
acids having a similar hydropathic index or score and still result in a protein with similar 
biological activity, i.e., still obtain a biological functionally equivalent protein. In making 
such changes, the substitution of amino acids whose hydropathic indices are within ±2 is 
preferred, those which are within ±1 are particularly preferred, and those within ±0.5 are 

even more particularly preferred. 

It is also understood in the art that the substitution of like amino acids can be 
made effectively on the basis of hydrophilicity. U. S. Patent No. 4,554,101, incorporated 
herein by reference, states that the greatest local average hydrophilicity of a protein, as 
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governed by the hydrophilicity of its adjacent amino acids, correlates with a biological 

property of the protein. 
" As ^tailed in U. S. Patent No. 4,554,101, the followmg ^ydrophilicUy value, 

have been assigned to amino acid residues: argihine ^^0^0^^ 
± 1); glutamate (+3.0 ± 1); serine ( + 0.3); asparagine (+0.2); glutamine (+0,2);gjycme (0); 
threonine (-0.4); proline (-0.5 ± 1); alanine (-0.5); histidihe (-0.5); cystine ^im 
methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); 
phenylalanine (-2.5); tryptophan (-3.4). 

It is understood that an amino acid can be substituted for another having a similar 
hydrophilicity value and still obtain a biologically equivalent, and in particular, an 
immunologically equivalent protein. In such changes, the substitution of amino acids 
whose hydrophilicity values are within ±2 is preferred, those which are within ±1 are 
particularly preferred, and those within ±0.5 are even more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based on the 
relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions winch 
take various of the foregoing characteristics into consideration are well known to those of 
skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine. 

2.10 Site-Specific Mutagenesis — 

Site-specific mutagenesis is a technique useful in the preparation of individual 
peptides, or biologically functional equivalent proteins or peptides, through specific 
mutagenesis of the underlying DNA. The technique further provides a ready ability to 
prepare and test sequence variants, for example, incorporating one or more of the 
foregoing considerations, by introducing one or more nucleotide sequence changes into 
the DNA. Site-specific mutagenesis allows the production of mutants through the use of 
specific oligonucleotide sequences which encode the DNA sequence of the desired 
mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer 
sequence of sufficient size and sequence complexity to form a stable duplex on both sides 
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of the deletion junction being traversed. Typically, a primer of about 17 to 25 nucleotide's 
in length is preferred, with about 5 to 10 residues oh both sides of the junction of the 

sequence being altered. 

In general, the technique of site-specific mutagenesis is wdl loiqwri in the att,.as 
exemplified by various publications. As will be appreciated, the technique typically 
employs a phage vector which exists in both a single stranded and double stranded f^ 
Typical vectors useful in site-directed mutagenesis include vectors such as the M13 
phage. These phage are readily commercially available and their use is generally well 
known to those skilled in the art. Double stranded plasmids are also routinely employed 
in site directed mutagenesis which eliminates the step of transferring the gene of interest 

from a plasmid to a phage. 

In general, site-directed mutagenesis in accordance herewith is performed by first 
obtaining a single-stranded vector or melting apart of two strands of a double stranded 
vector which includes within its sequence a DNA sequence which encodes the desired 
peptide. An oligonucleotide primer bearing the desired mutated sequence is prepared, 
generally synthetically. This primer is then annealed with the single-stranded vector, and 
subjected to DNA polymerizing enzymes such as E. coli polymerase I Klenow fragment, 
in order to complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex is 
formed wherein one strand encodes the original non-mutated sequence and the second 
strand bears the desired mutation. This heteroduplex vector is then used to transform 
appropriate cells, such as E. coli cells, and clones are selected which include recombinant 
vectors bearing the mutated sequence arrangement. . _ 

The preparation of sequence variants of the selected peptide-encoding DNA 
segments using site-directed mutagenesis is provided as a means of producing potentially 
useful species and is not meant to be limiting as there are other ways in which sequence 
variants of peptides and the DNA sequences encoding them may be obtained. For 
example, recombinant vectors encoding the desired peptide sequence may be treated with 
mutagenic agents, such as hydroxylamine, to obtain sequence variants. 

2.11 Crystal Protein Compositions As Insecticides and Methods oTUse 



-35- 

A: 59SS7(I9Z70I!.DOC) 



r- ( 

The invents collate' <ha. the chimeric crystal protein composition, . 
disdosed herein find parricuiar utility * insecticides ■** «#* and/or system* 
apphcanon .0 fie.d crops, grasses, fruits. ** vegetabta, an. ornametUa, bUn|. In a 
purred embodiment, the btoinsechcde composttlop comprises an o„ flowahle 
suspension of ^ria, ceUs which express a noveUtysta, pro,,, disced herem 
FwUii-Va.* 5 tkuringiertis -Celfe,- .Jioweyeii any suc h bacterial bos, 
expressing the novo, nucierc acid segments disclosed herein and peeing a eryaU! 
protein is conren.pia.ed ,0 he useful, such as B. ^.eriun,. B. suM*. * «* 

Pseudomonas spp. ^ m •„_„ „ 

to another import embodiment, the bioinsecticide compos.tton comprtses 

water dispersible granule. This granule comprises bacterial ceUs which expresses a novo, 
erysta! protein disclosed herein. Preferred bacterial cells are B. 
however, bacteria such as B. m e g a«H m B. l coil, or P— spp. cells 

^formed with a DNA segment disclosed herein and expressmg the crystal protetn are 

also contemplated to be useful. 

b a third important embodiment, the bioinsecticide composition comprtses a 
wettab,e powder, dust, psllet, ot coUodial concentrate. This powder comprises bactertal 
cefis which expresses a novel crystal protetn disclosed herein. Preferred actertal ceUs 
are B. ^ — bacteria such as B. B. **** * coU, 

ne^nas spp. ceUs transformed with a DNA segment disclosed heretn an 
expressing the crystal protein are tdso contemplated to be useful. Suchdry forms of the 
insecticida. compositions may be formuiated to disso.ve immediately upon wetttng o 
alternatively, dissolve in a conttofied-release. sustained-te.ease, or other time-dependent 



manner. 



,„ a fourth important embodiment, the bioinsecticide composition comprtses an 
aooeous suspension of bacteria, cells such as those described above which express the 
crystal protein. Such aqueous suspenses may be provided as a concentrated stock 
solution which is diiuted prior to application, or alternative,,, as a di.uted so.ution ready- 
to-apply. 
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For ,be S e methods involving application of oactcial ceils, me 0*.!*- ho gt _ . 
containing Che crysta. protein g ene,s) may be grown in any convenient 
where .he DNA cons™, provide, a selective advantage, providing 
medium ,0 tha, suhsranual.y ah o, ah of the cells retain the It. 0,rin^ gene. the* 
ceUs may then be harvested in accordance with conventional ways. ,JW».*e:y,.me 
cells can be treated prior to harvesting. 

When the msecUcidal compos,.ions comprise intact B. Hurt**- cells 
expressing me protein of interest, such bacteria may be formulated in a variety of ways. 
They may be employed as wettable powders, granules or dusts, by mixang wtth vanous 
inert materials, such as inorganic minerals (phyl.osilica.es, carbona.es su_fates 
phosphates, and .he like) or bo.a„ical materials (powdered corncobs, nee hulls, «lnu, 
sheUs, and the like). The formulations may include spreader-sticker adju ,-an.s, stabt^ng 
agents, omer pesticidal addirives, or surface. Liquid formulations may be aqtieous- 
b ased or non-aqueous and employed as foams, suspensions, emulsive concen.ra.es, or 
to like. The ingredient may include theological agent, surfaces, emulsafiers, . 

dispersants, or polymers. 

Altemarive.y, .he novel chimeric Cry pro.eins may be prepay by recom.nan 
serial expression systems in „<ro and isola.ed for subsequent field app.ica.ion. Such 
protein may be either in crt.de cel. lysates. suspensions, colloids, etc., o. Uternauve,, may 
* purified, refined, buffered, and/or further processed, before formuladng in an actrve - 
b.ocida, formulation. Likewise, under certain circumstances, i. mayj* des.ra.le to 
isol a.e crystals and/or spores from bacteria, cultures expressing the crystal proter. and 
apply solutions/suspensions, or collodia, preparations of such crystals and/or speces as 

the active bioinsecticidal composition. 

Regardless of the method of application, .he amount of .he aett ve com P ooen.(s) 
„ apphed a, an msecticidally-effective amount, which will vary depending on such 
factors as, for example, me specific coleopteran .nsects to be contro.led. the specific plan - 
ar crop to be treated, the environmental conditions, and the method, rate., and quar, aty o 
application of the insecticidally-active composition. ^ 
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The insecticide compositions described may be made by formulating either the . 
bacteria! cell, crystal and/or spore suspension, or tsolateu protein component wrth me 
desired agncnlturally-acceptable carrier. Tbe compositions may be formulated ortor .0 
administration in an appropriate means such as lybphilM, f^e-dried, deseed, ortn 
„ aqueous came, medium or suitable diluent, such a., sgine or other butiet. The 
formula** compositions may be in the form of a dus, or .granular matenal. er a 
suspension in oil (vegetable or mineral), or water or oil/water emulsions, or as a wetmbl, 
powder, or in combination with any other earner material suitable for agrtcuWuml 
[ppfication. Suitable agricultural carriers can be solid or liquid and are we,, brown m me 
a„ The term "agriculturally-acceptable earner" covers all adjuvarUS, e.g., mer, 
components, dispersams, surfactant, .ackifiers, binders, ere. that are ordinarily- used » 
■asecticde formulation technology; these are well known ,0 .hose skilled th msectie.de 
formulation. The formu.a.ions may be m,*ed wim one or more so.id or lion d adjuvant 
and prepared by various means, by homogeneously mixing, blending and/or gntrdmg 
to insecticida. composition with suttable adjuvants using conventional formulae 

lCCbm Z insecricidal compositions of rhrs invention are applied to the environment of 
the target co.eopteran insect, typicajly onto .he foliage of .he plant or crop to be pro.ec.ed 
by conventional methods, preferably by spraying. The strength and durauon of 
oLcticidal application wil, be se, with regard to conditions specific .0 me partrcuUr 
pes.Cs>, crop ( s) to be treated and par.icu.ar environmental conditions, The proper^ 
L of active ingred.ent to carrier w«. naturaHy depend on .the chemtcal nam e, 
sotabilhy, and stability of me insecticidal composition, as wel, as to .partrcular 
formulation contemplated. 

0.her application techniques, e.g., dusting, sprtnkling, soaking, soil mjecfion, seed 
coating, seedltng coating, spraying, aerating, ousting, — g, and .he .ike, are also 
feasible and may be required under certain eireumsranees such as e.g., msec* ma. cause 
roo , or s«alk infesration, or for application .o delicate vege.a,ion or omame„.a, plan., 
These application procedures are also well-known ,0 .hose of skill tn the .art. 
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Th e insecticida, composition o f M^M^M* !» £ * - ' 

lh e — singiy or in —on wtth diner co^ — and no. 
I ! pestictdl The memod of the invention may aUo be in -i— 
« .men* such as sn.ac.an,, de.erge»,s, po, W e, o. .....Cease —tons. 

?, L*. -po^s uf me Pre se, *** 

systemic oi. epical use. ,„„ "hich is used for environmentel. 

The concentration of insecticide! composition whtch .s used 
systemic o. foiiar app.ication wiU vary wideiy depending upon » namre of .he 
/Ill; .'— «Uca.ion, — s. and de^ 

" ida. activi.y. Typical* .he bioinsecticida, composidon win be presen. ,n .he apphed 
— a. a coldadon of a. ,eas. abo, 0.5% by weigh, and may be up ^ 
7 h . •» by weigh. Dry formulations of me compositions may be from 

:r r ; r t« - * - - — • -* ^ 

riadons may general* compr.se from about 0,* .0 abou, 

active ingtedien. by weigh, Formniadons which compnse - bactenaf ceUs . 

generally con* from abou, 10* to abou. 10» ceUs/mg. 

The insec.ic.dal formation may be admmis.ered .o a particular plan, or targe, 
.ea in one or more applications as needed, with a .ypica. Held 
hKW rang ,„g „„ .he order of from abou. 50 g .0 abou. 500 g of active mgmdten, or o 
Hon. 500 g to about ,0=0 g, or of from abou, 1000 g .o about 5000 g or more of- 
active ingredient. — 
212 Chemical and Pharmaceutical Compositions 

" Z inventors coa.emp.ate in addtdon to treating p.ants and .heir en— 
„ ith the iusecticidal compositions of .he invention, .he tieatmeu. of animals an to 

activi.y agatns. inseCs which infest animals and .heir environment. 

The ir.ven.ors particu,ar,y contemplate the use of chimeric crysta! pr„.e.ns m me 
fom u,a,io„ of bioinsectictdes comprtsing specif.c chtmerie 8-endo.oxin pro.eins whtc 
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other ao.es.ic a„ima,s. m —ion of pha—ca, colons Which mayfc _ . 

such infestations. fasecls as described in detail nn U. S. Patent 

Uorated herein by reference, may be P^*«*<W j* . 

I I Soch insects inciude members of the Cenera ^ 

^ — , — . 

Demrnoiria, and DnmoiWf among others. well-known in 

Means for administering insecticidal compositions to an arumal are well 
the an U. S. Patent No. 5,4.6,02 (specific* incorporated herern by ^ 

:■ v«.rinarv-aonrovedformulauonsmaybedelrvereain 
using an insecticidal composition. Vetermary approve 
a variety of methods depending upon the particular application. 

« is further contemplated that in addition to topical adm.nrstrat.on of th 
p _utical composrtions disclosed, systemic administration may in some ^ 
preferable or desirable. For oral administration, the composes may be formulared 
IIH! inen diiuent or wUn an assimUatabie ed,b.e earner, or rbey may be enclosed 
ZZ soft-shel, gelatin capsule, or rhey may be compressed into table., or they may be 
I rporated d.rectly with the food of tine die, For oral therapeutic adm.mstiat.on h 

rablets buccal tables, troches, capsules, eUxirs, suspensions, syrops, wafers, and .he ,*e. 
stch compositions and preparations shou.d contain a, leas, 0,% of active compo^ 
The percentage of the compositions and preparations may, Of course, be vaned and may 
"LenUy I between abou, , ,0 about « of the weigh, of ,he unn The amount 
active compounds in such Iberapeutically usefu, compos.tions is such ,ha, 

d0Mge - a,- Protem may be incorporated wilh ocipien* and 

used ,„ ,he form of a gel, paste, powder, piU, table,, oapsuie, or s,urry which may be g,v 
„ ,he annua, for ingestion. Amative!, the compositions may be fornuda,^ a 
active ,o pe, foods, treats, or olher edib,e —ions. When fonnulamd as a table. 
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. • fhf , fniiowins: a binder, as gum 
capsule or the like, the composition may also contain the following _ 

drsntegrating agent, such as com starch, potato starch, atgrnrc aord and rhe l*e, a 

sacctain may ha added or a flavoring age„, to mate the compost more p*, b e ro 
rlaJrrrg^. Ooe arch means ror M 

sauce as described in U. S. Paten, No. 4,702,9.4 (specflcaUy incorporared here, by 

refere "l en .he dosage unit form is a capsme, i, may contain, in addition to — of 
te above type, a U q »id easier. Various orher materia, may he present - «-» - » 
otewise modify rhe physica, form of the dosage unit. For rnstance, tahiers P* 
capsu.es may he coated with sheilac, sugar or hod, Of course, any materta. sed m 
adng any dosage unit form shomd he pharm— y pure and suhstantraUy non 
LThl — empioyed. m addirron, the active compounds may he rneorporated 
into sustained-release preparauon and formulations. 

natively, the pharmacentieal eomposirions disclosed herem may he 
adhered parenteraUy. inrramuscularly, or even intraperitoneal*. Solouons of th 
1 compounds as free hase or pharmaco.ogica.,y acceptable salrs may be prepared m 

a,so he prepared in glycerol, liquid polyethylene glycols, and mixrures thereof and n 0.1, 

pre ven« the growth of microorganisms. The phan.acen.ica, forms sn.tab.e or mjectaM 
L include steri, a q ueous so.ur.ons or dispersions and sterile powde, for he 
Imporaneous preparation of steri, instable soluhons or dispersions, fn a, eas .he 

* st ah,e under the conditions of manufacture and storage and must he 

the contaminating acuon of microorganisms, such as bacteria and fun,. The - J 

„ a solven, or dispersion medium con.ain.ng, for example, water, ethanol, po.yo (e.g 

mlreof, and/or vegetah.e o.ls. Proper fluid.ty may be maintained, for example, by the 
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of a coating, such as lecithin, by the maintenance of the required particle size in the case 
of dispersion and by the use of surfactants. The prevention of the . action , of 
microorganisms can be brought about by various antibacterial ad antifungal agents, 4or 
Cample, parabens, chlorobutanpl, phenol, sorbic acid, thimerosal, and the like, In many 
5 - cases it will be preferable to include isotonic agents, for example, .ugars or sodium 
chloride. Prolonged absorption of the injectable compositions can be brougnt about by. 
the use in the compositions of agents delaying absorption, for example, aluminum 

monostearate and gelatin. 

When systemic administration is desired, e.g., parenteral administration in an 
10 • aqueous solution, the solution should be suitably buffered if necessary and the liquid 
diluent first rendered isotonic with sufficient saline or glucose. These particular aqueous 
solutions are especially suitable for intramuscular, subcutaneous and intraperitoneal 
administration. In this connection, sterile aqueous media which can be employed will be 
known to those of skill in the art in light of the present disclosure. Some variation in 
15 dosage will necessarily occur depending on the condition, size, and type of animal being 
treated. The person responsible for administration will, in any event, determine the 
' appropriate dose for the individual subject. Moreover, for human administration, 
preparations should meet sterility, pyrogenicity, general safety and purity standards as 
required by FDA Office of Biologies standards. 
20 Sterile injectable solutions are prepared by incorporating the active compounds in 

the required amount in the appropriate solvent with various of the rfher ingredients 
enumerated above, as required, followed by filtered sterilization. Generally, dispersions 
are prepared by incorporating the various sterilized active ingredients into a sterile vehicle 
which contains the basic dispersion medium and the required other ingredients from those 
25 enumerated above. In the case of sterile powders for the preparation of sterile injectable 
solutions, the preferred methods of preparation are vacuum-drying and freeze-drying 
techniques which yield a powder of the active ingredient plus any additional desired 
ingredient from a previously sterile-filtered solution thereof. 

The compositions disclosed herein may be formulated in a neutral or salt form. 
30 Pharmaceutically-acceptable salts, include the acid addition salts (formed with the free 
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amino groups of the protein) and which are formed with inorganic acids such as, for ,. 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic,' 
tartaric, mandelic, and the like. Salts formed with the free catoyl gr^cay^e. : 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 
histidine, procaine and the like. Upon formulation, logons * ' 

manner compatible with the dosage formulation and in such amount as is therapeutically 
effective. The formulations are easily administered in a variety of dosage forms; such as 
creams, lotions, sprays, dips, emulsions, colloids, or alternatively, when systemic 
administration is desirable, injectable solutions, drug release capsules and the like. 

As used herein, "carrier" includes any and all solvents, dispersion media, vehicles, 
coatings, diluents, antibacterial and antifungal agents, isotonic and absorption delaying 
agents, buffers, carrier solutions, suspensions, colloids, and the like. The use of such 
media and agents for pharmaceutical active substances is well known in the art. Except 
insofar as any conventional media or agent is incompatible with the active ingredient, its 
use in the therapeutic compositions is contemplated. Supplementary active ingredients 
can also be incorporated into the compositions. 

The phrase "pharmaceutically-acceptable" refers to molecular entities and 
compositions that do not produce an allergic or similar untoward reaction when 
administered to a animal. The preparation of an aqueous composition that contains a- 
protein as an active ingredient is well understood in the art. Typically, such compositions 
are prepared as injectables, either as liquid solutions or suspensions; solid forms suitable 
for solution in, or suspension in, liquid prior to injection can also be prepared. The 
preparation can also be emulsified. 

The inventors further contemplate the use of chimeric crystal proteins of the 
present invention as active ingredients in pharmaceutical compositions for administration 
to the living areas and environment of an animal to prevent, lessen, or reduce the" 
infestation of susceptible insects in such an area. The chimeric proteins or a suspension 
of cells which express the particular chimeric protein(s) may be formulated in a powder, 
spray, fog, granule, rinse, shampoo, dip, etc. suitable for administrate to the living 
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garters, bedding materials, or environment of the animal using techniques which are _ 
known to those of skill in the « of veterinary insecticide formulations. An example of 
oral formulation of veterinary insecticides is found in the .eachurgsof tl. S, fate^ »o. 
5,416,102. 

2 13 Antibody Compositions and Methods for Producing 

In particular embodiments, the inventors contemplate the use of antibodies, either 
monoclonal or polyclonal which bind to the crystal proteins disclosed herein. Means for 
preparing and characterizing antibodies are well known in the art (See, e.g., Harlow and 
Lane ,988; incorporated herein by reference). The methods for generating monoelonal 
antibodies (mAbs) generally begin along the same lines as those for preparing po.yelonal 
antibodies. Briefly, a polyclonal antibody is prepared by immunizing an animal wnh an 
immunogenic composition in accordance with the present invention and collectmg 
antisera from that immunized animal. A wide range of animal species can be used for the 
production of antisera. Typically the animal used for production of anti-antrsera rs a 
rabbit, a mouse, a rat, a hamster, a guinea pig or a goat Because of the relatively large 
Hood volume of rabbits; a rabbi, is a preferred choice for production of polyclonal 
antibodies. 

As is well known in the art, a given composition may vary in its immunogenic^. 
It is often necessary therefore to boost the host immune system, as may be achieved by 
coupling a peptide or polypeptide immunogen to a carrier. Exemplary and preferred 
carriers are keyhole limpet hemocyanin (KLH) and bovine serum albumin (BSA). Other 
albumins such as ovalbumin, mouse serum albumin or rabbit serum albumin can also be 
used as carriers. Means for conjugating a polypeptide to a carrier protein are well known 
in the art and include glutaraldehyde, m -maleimidobencoyl-N-hydroxysUccinimide ester, 
carbodiimide and bis-biazotized benzidine. 

As is also well known in the art, the immunogenicity of a particular immunogen 
composition can be enhanced by the use of non-specific stimulators of the immune 
response, known as adjuvants. Exemplary and preferred adjuvants include complete 
Freund's adjuvant (a non-specific stimulator of the immune response containing killed 
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Mycobacterium ,uberculo Si s>, incomplete Freund's adjuvants and aluminum mydroxide _ 
adjuvant. 

The amount of imtnunogen composition used in the production Qf ptolyclonaJ 
antibodies vanes upon the nature of the immunogen as wkl as ,he animal used fo, 
ionization. A varrety of routes can be used to administer ,he imt.-.utn.gen 
(subcutaneous, intramuscular, intradermal, intravenous and mtraperitoaea, ,, The 
production of polyclonal antibodies may be monitored by sampling bloom of the 
immunized animal at vanous points following immunization., A second, booster, 
injection 'may also be given. The process of boosting and titering is repeated untt. a 
suitable titer is achieved. When a desired level of ■mmnnogenicity is obtrumed, the 
immunized animal can be bled and the serum isolated and stored, and/or the anumal can 

be used to generate mAbs. 

mAbs may be readily prepared through use of well-known technique,, such as 
those exemplified in U. S. Patent No. 4,196,265 (specifically incorporated nerein by 
reference). Typically, this technique involves immunizing a suitable anirmul with a 
selected immunogen composition, e.g., a purified or partially purified crystal protem, 
polypeptide or peptide. The immunizing composition is administered in ; manner 
effective to stimulate antibody producing cells. Rodents such, as mice ano^ rats are 
preferred animals, however, the use of rabbit, sheep frog cells is also possible. . ne use of 
rats may provide certain advantages (Coding, 1986, pp. 60-61), but mice are preferred, 
with the BALB/c mouse being most preferred as this is most routinely used and generally 
gives a higher percentage of stable fusions. . . 

Following immunization, somatic cells with the potential for producing 
antibodies, specifically B lymphocytes (B cells), are selected for use in the mAb 
generating protocol. These cells may be obtained from biopsied spleens, tonsils or lymph 
nodes, or from a peripheral blood sample. Spleen cells and peripheral blooa cells are 
preferred, the former because they are a rich source of antibody-producing cells .tnat are in 
the dividing plasmablast stage, and the latter because peripheral blood is easily accessible. 
Often, a panel of animals will have been immunized and the spleen ofanimai with the 
highest antibody titer will be removed and the spleen lymphocytes obtained by 
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... i *u o Tvnirallv a soleen- from an immunized mouse 

homogenizing the spleen with a syringe, lypicany, a s>piccu 

contains approximately 5 x 10 7 to 2 x 10 8 lymphocytes. . ' 

The antibody-producing B lymphocytes from the immunized ariM are ^ 
fused with cells of an immortal myeloma cell, generally one ,of ^ same 
animal that was immunized. Myeloma cell lines suited for us| in hybridom^ducing 
fusion procedures preferably are non-antibody-producing, have high fusion efficiency, 
and enzyme deficiencies that render then incapable of growing in certain selective media - 
which support the growth of only the desired fitted cells (hybridomas). 

Any one of a number of myeloma cells may be used, as are known to those of skill 
in the art (Coding, pp. 65-66, 1986; Campbell, pp. 75-83, 1984). For example, where the 
immunized animal is a mouse, one may use P3-X63/Ag8, X63-Ag8.653, NSl/LAg 4 1, 
S P 210-Agl4, FO, NSO/U, MPC-1 1, MPC1 1-X45-GTG 1.7 and S194/5XX0 Bui; for rats, 
one may use R210.RCY3, Y3-Ag 1.2.3, IR983F and 4B210; and U-266, GM1500-GRG2, 
LICR-LON-HMy2 and UC729-6 are all useful in connection with human cell fusions. 

One preferred murine myeloma cell is the NS-1 myeloma cell line (also termed 
P3-NS-l-Ag4-l), which is readily available from the NIGMS Human Genetic Mutant 
Cell Repository by requesting cell line repository number GM3573. Another mouse 
myeloma cell line that may be used is the 8-azaguanine-resistant mouse murine myeloma 

SP2/0 non-producer cell line. 

Methods for generating hybrids of antibody-producing spleen or lymph node cells 
and myeloma cells usually comprise mixing somatic cells with myeloma cells in a 2:1 
ratio, though the ratio may vary from about 20:1 to about 1:1, respectively, in the 
presence of an agent or agents (chemical or electrical) that promote the fusion of cell 
membranes. Fusion methods using Sendai virus have been described (Kohler and 
Milstein, 1975; 1976), and those using polyethylene glycol (PEG), such as 37% (v/v) 
PEG, (Gefter et al, 1977). The use of electrically induced fusion methods is also 
appropriate (Goding, 1986, pp. 71-74). 

Fusion procedures usually produce viable hybrids at low frequencies, about 
1 x 10- 6 to 1 x 10- 8 . However, this does not pose a problem, as the viable, fused hybrids 
are differentiated from the parental, unfused cells (particularly the unfused myeloma cells 
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that would normally continue to divide indefinitely) by eulturing in a selective medium. 
The selective medium is generally one that contains an agent that blocks the de novo 
synthesis of nucleotides in the tissue culture media. Exempi^and prefer^ .agenls,are 
aminopterin, methotrexate, and azaserine. Mck^^0^^^^^ 
synthesis of both purines and pynmidines, whereas azaserine blocks ordf purine 
synthesis. Where aminopterin or methotrexate is used, the media is supplemented with 
hypoxanthine and thymidine as a source of nucleotides (HAT medium). Where azaserine 
is used, the media is" supplemented with hypoxanthine. 

The preferred selection medium is HAT. Only cells capable of operating 
nucleotide salvage pathways are able to survive in HAT medium. The myeloma cells are 
defective in key enzymes of the salvage pathway, e.g., hypoxanthine phosphoribosyl 
transferase (HPRT), and they cannot survive. The B-cells can operate this pathway, but 
they have a limited life span in culture and generally die within about two weeks. 
Therefore, the only cells that can survive in the selective media are those hybrids formed 
15 from myeloma and B-cells. 

This eulturing provides a population of hybridomas from which specific 
hybridomas are selected." Typically, selection of hybridomas is performed by eulturing 
the cells by single-clone dilution in microliter plates, followed by testing the individual 
clonal supernatants (after about two to three weeks) for the desired reactivity. The assay 
should be sensitive, simple and rapid, such as radioimmunoassays, enzyme 
immunoassays, cytotoxicity assays, plaque assays, dot immunobindingjissays, and the 

like. . . 

The selected hybridomas would then be serially diluted and cloned into .individual 
antibody-producing cell lines, which clones can then be propagated indefinitely to 

25 provide mAbs. The cell lines may be exploited for mAb production in two basic ways. A 
sample of the hybridoma can be injected (often into the peritoneal cavity) into a 
histocompatible animal of the type that was used to provide the somatic and myeloma 
cells for the original fusion. The injected animal develops tumors secreting the specific 
monoclonal antibody produced by the fused cell hybrid. The body fluids of the animal, 

30 such as serum or ascites fluid, can then be tapped to provide mAbs in high concentration. 



20 



-47- 

A: 595R7(I9Z?01!.DOC) 



r . L 

The individual cell lines could also be cultured in Vitro, where the mAbs are naturally 
secreted into the culture medium from which they can be readily obtained in high 
concentrations. mAbs produced by either means may be furdi^purified 
filtration, cehtrifUgation and various chromatographic methods such as HPfeC or iffrriity 
5 chromatography. 

3. Brief Description of the Drawings 

The following drawings form part of the present specification and are included to 
further demonstrate certain aspects of the present invention. The invention may be better 
10 understood by reference to one or more of these drawings in combination with the 
detailed description of specific embodiments presented herein. 

FIG. 1. The wild-type 5-endotoxins and the relevant restriction sites that 
were used to construct the hybrid 8-endotoxins pertinent to the invention are diagrammed 
15 in FIG. 1A. Only the DNA encoding the 6-endotOxin that is contained on the indicated 
plasmid (identified by the "pEG" prefix) is shown. The B. thuringiensis strains 
containing the indicated plasmids are identified by the "EG" prefix. The hybrid 
8-endotoxins described in the invention are diagrammed in FIG. IB and are aligned with 

the wild-type 5-endotoxins in FIG 1A. 
20 FIG< 2 . An equal amount of each washed sporulated B. thuringiensis 

culture was analyzed by SDS-PAGE. Lane a: control Cryl Ac producing^, thuringiensis 
strain EG1 1070, b': EG11060,c: EG11062,d: EG11063,e: EG11065,f: EG11067,g: 
EG11071,h: EG11073,i: EG11074J: EG11088,k: EG1 1090, and 1: EG11091. 

FIG. 3. Solubilized hybrid 5-endotoxins were exposed to trypsin for 0, 15, 
25 30, 60, and 120 minutes. The resulting material was analyzed by SDS-PAGE. The 
amount of active 5-endotoxin fragment remaining was quantitated by scanning 
densitometry using a Molecular Dynamics model 300A densitometer. The percent active 
toxin remaining was plotted versus time. Wild-type CrylAc 6-endotoxin (open box) 
served as the control. 
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4. brief Description of the Sequences 

SEQ ID NO: 1 is oligonucleotide primer A. 
5 SEQ ID NO:2 is oligonucleotide primer B. ^ * c/ -V 

SEQ ID NO:3 is oligonucleotide primer C. 

SEQ ID NO:4 is oligonucleotide primer D. 

SEQ ID NO:5 is oligonucleotide primer E. 

SEQ ID NO:6 is oligonucleotide primer F. 
10 SEQ ID NO:7 is oligonucleotide primer G. 

SEQ ID NO:8 is oligonucleotide primer H. 

SEQIDNO:9 is the nucleotide and deduced amino acid sequences of the EG 11063 
hybrid 5-endotoxin. 

SEQ ID NO: 10 denotes the three-letter abbreviation form of the amino acid sequence for 
15 the hybrid 5-endotoxin specified in SEQ ID NO:9. 

SEQ ID NO: 11 is the nucleotide and deduced amino acid sequences of the EG11074 
hybrid 5-endotoxin. 

SEQ ID NO: 12 denotes the three-letter abbreviation form of the amino acid sequence for 
the hybrid 5-endotoxin specified in SEQ ID NO: 1 1 . 
20 SEQ ID NO: 13 is the nucleotide and deduced amino acid sequences of the EG11735 
hybrid 5-endotoxin. — 
SEQ ID NO: 14 denotes the three-letter abbreviation form of the amino'acid sequence for 

the hybrid 5-endotoxin specified in SEQ ID NO: 13. 
SEQ ID NO: 15 is the 5' exchange site for pEG1065, pEG1070, and pEG1074. 
25 SEQ ID NO: 16 is the 5' exchange site for pEG1067, pEG1072, and pEG1076. 
SEQ ID NO: 17 is the 5' exchange site for pEG1068, pEG1077, and pEG365. 
SEQ ID NO: 18 is the 5' exchange site for pEG1088 and pEG1088. 
SEQ ID NO: 19 is the 5' exchange site for pEG1089 and the 3' exchange site for pEG1070 
and pEG 1072. 
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SEQ ID NO:20 is the 5' exchange site for pEG1091. 

SEQ ID NO:21 is the 3' exchange site for pEG1065, pEG1067, pEG1068, and pEG 365. 
SEQ ID NO:22 is the 3' exchange site for pEG1088. 
SEQ ID NO:23 is oligonucleotide Primer I. 

SEQ ID NO:24 is oligonucleotide Primer J. ••';'.< 'V 

SEQ ID NO:25 is the nucleic acid sequence and deduced amino acid sequence of the 

hybrid crystal protein-encoding gene of EG 1 1 092. 
SEQIDNO:26 is the three-letter abbreviation form of the amino acid sequence of the 

hybrid crystal protein produced by strain EGl 1092 encoded by SEQ ID NO:25. 
SEQ ID NO:27 is the nucleic acid sequence and the deduced amino acid sequence of the 

hybrid crystal protein-encoding gene of EGl 175 1. 
SEQ ID NO:28 is the three-letter abbreviation form of the amino acid sequence of the 

hybrid crystal protein produced by strain EGl 175 1 encoded by SEQ ID NO:27. 
SEQ ID NO:29 is the nucleic acid sequence and the deduced amino acid sequence of the 

hybrid crystal protein-encoding gene of EG 1 1 09$. 
SEQ ID NO:30 is the three-letter abbreviation form of the amino acid sequence of the 

hybrid crystal protein produced by strain EGl 109f encoded by SEQ ID NO:29. 

5. Description or Illustrative Embodiments 

5.1 Methods for Culturing B. thuringiensis to Produce the Novel Proteins 

The B. thuringiensis strains described herein may be culturedusing standard 
known media and. fermentation techniques. Upon completion of the fermentation cycle, 
the bacteria may be harvested by first separating the B. thuringiensis spores arid crystals 
from the fermentation broth by means well known in the art. The recovered B. 
thuringiensis spores and crystals can be formulated into a wettable powder, a liquid 
concentrate, granules or other formulations by the addition of surfactants, dispersants, 
inert carriers and other components to facilitate handling and application for particular 
target pests. The formulation and application procedures are all well known in the art and 
are used with commercial strains of B. thuringiensis (HD-1) active against Lepidoptera, 
e.g., caterpillars. 
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5.2 Recombinant Host Cells For Expression of the Novel Chimeric Genes 

The nucleotide sequences of the subject invention can be introduced into a^wide 
variety of microbial hosts. Expression of the toxin gene J^^ 0^^ / 
the intracellular production and maintenance of the pesticide. Wk^ suitable ; . 

Pseudomonas, the Aerobes can be applied to the tS ites,of4epidopterah insects where they 
will proliferate and be ingested by the insects. The results is a control of the unwanted 
insects. Alternatively, the microbe hosting, the toxin gene can be treated under conditions 
that prolong the activity of the toxin produced in the cell. The treated cell then can be 
applied to the environment of target pest(s). The resulting product retains the toxicity of 

the B. thuringiensis toxin. 

Suitable host cells, where the pesticide-containing cells will be treated to prolong 
the activity of the toxin in the cell when the then treated cell is applied to the environment 
of target pest(s), may include either prokaryotes or eukaryotes, normally being limited to 
those cells which do not produce substances toxic to higher organisms, such as mammals. 
However, organisms which produce substances toxic to higher organisms could be used, 
where the toxin is unstable or the level of application sufficiently low as to avoid any 
possibility or toxicity to a mammalian host. As hosts, of particular interest will be the 
prokaryotes and the lower eukaryotes, such as fungi. Hlustrative prokaryotes, both Gram- 
negative and Gram-positive, include Enterobacteriaceae, such as Escherichia, Erwinia, - 
Shigella, Salmonella, and Proteus; Bacillaceae; Rhizobiceae, such_ as Rhizobium; 
Spirillaceae, such as photobacterium, Zymomonas, Serratia, Aeromonas, Vibrio, 
Desulfovibrio, Spirillum; Lactobacillaceae; Pseudomonadaceae, such as Pseudomonas 
and Acetobacter; Azotobacteraceae, Actinomycetales, and Nitrobacteraceae. Among 
eukaryotes are fungi, such as Phycomycetes and Ascomycetes, which includes yeast, such 
as Saccharomyces and Schizosaccharomyces; and Basidiomycetes yeast, such as 
Rhodotorula, Aureobasidium, Sporobolomyces, and the like. 

Characteristics of particular interest in selecting a host cell for purposes of 
production include ease of introducing the B. thuringiensis gene into the host, availability 
of expression systems, efficiency of expression, stability of the pesticiTe in the host, and 
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the presence of auxiliary genetic capabilities. Characteristics of interest for use as a ^ 
pesticide microcapsule include protective qualities for the pesticide, such as thick cell 
walls, pigmentation, and intracellular packaging or formation of inclusion bodies; leaf 
affm it y ; lack of mammalian toxicity; attractiveness to pests for ingestion; ease iof Ming 

5 and fixing without damage to the toxin; and the like. Other considerations include ease 
of formulation and handling, economics, storage stahlity , and the Ike. 

Host organisms of particular interest include yeast, such as Rhodotorula sp., 
Aureobasidium sp., Saccharomyces s P , and Sporobolomyces sp, phylloplane organisms 
such as Pseudomonas sp., Erwinia sp. and Flavobacterium sp.; or such other organisms 

10 as Escherichia, Lactobacillus sp., Bacillus sp., Streptomyces sp., and the like. Specific 
organisms include Pseudomonas aeruginosa, Pseudomonas fluoresces, Saccharomyces 
cerevisiae, Bacillus thuringiensis, Escherichia coli, Bacillus subMlis, Streptomyces 
lividans and the like. 

Treatment of the microbial cell, e.g., a microbe containing the B. thuringiensis 
15 toxin gene, can be by chemical or physical means, or by a combination of chemical and/or 
physical means, so long as the technique does not deleteriously affect the properties of the 
toxin, nor diminish the cellular capability in protecting the toxin. Examples of cheimcal 
reagents are halogenating agents, particularly halogens of atomic no. 17-80. More 
particularly, iodine can be used under mild conditions and for sufficient time to achieve 
20 the desired results. Other suitable techniques include treatment with aldehydes, such as 
formaldehyde and glutaraldehye; anti-infectives, such as zephiran chloride and 
cetylpyridinium chloride; alcohols, such as isopropyl and ethanol; various histologrc 
fixatives, such as Lugol's iodine, Benin's fixative, and Helly's fixatives,, (see e.g., 
Humason, 1967); or a combination of physical (heat) and chemical agents that preserve 
25 and prolong the activity of the toxin produced in the cell when the cell is administered to 
a suitable host. Examples of physical means are short wavelength radiafion such as y- 
radiation and X-radiation, freezing, UV irradiation, lyophilization, and the like. The cells 
employed will usually be intact and be substantially in the proliferative form when 
treated, rather than in a spore form, although in some instances spores may be employed. 
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Where the* ***** ■** * ■">•*■» v^^^^» _ 
mi „o b ia, host, and satd hos, is applied to .he environmen. in a living « »«g 
tha, certain hos, microbes be used.,Mic,oorganismnos,s a. seleclcd whtch are known n 

one or more crops of interest. These «tob^^i^»*^«*^: 

te wild-type microorganisms, provide for stable maintenance and expressron of me gene 
jessing the peptide pesticide, and, destrably, provide for improved prorecrton of 
the pesticide from environmental degradation and maeuvauon. 

A large nnmber of microorgantstns ate known to inhabit the phylloplane (the 
surf ace of me plan, leaves) and/or the rhizosphere (the soi, surronnding plant roots, oT a 
wi de variety of important crops. These microorganisms inc.udo bacten, algae .d 
fung i Of particular interest are microorganrsnrs, such as bacreria, ,g., genera Wta 

« — « — 

M L,e, Azotobacter, U~~~ and fungi, I 

g enera SaccHaro^. Coccus, 

and n. Of par,cu,ar interes. are such phyt.phere bacrenal as 

^ P— *"* ~™ ~ 

^ Ag— n — — ■ X — 

IpcsrW, «**»- ^i, — , and ***** 

phylphere yeast species such as ,,0,0,0™, ruoru, * * * 

and Aureobasidium pollulans. 



5.3 Definitions 

The following words and phrases have the meanings set forth below. 

Broad-Spectrum: refers to a wide range of insect species. 

Broad-Spectrum Insecticidal Activity: toxicity towards a wide ranged insect specs. 
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Expression- The combination of toWdhte process, including transcriotion and • 
translation undergone by a ooding DNA molecub such as a siructura! gene to 
produce a polypeptide. ; / V 

Insecticidal Activity , toxicity towards insects. 
5 Insectitidal Specificity: the toxicity exhibUed by a crystai protein towards muluple 

insect species. 

Intraorder Specificity: the toxicity of a particular crystal protein towards insect speetes 

within an Order of insects (e.g.. Order Lepidoptera). . ... 
Interorder Sspeelflcity: the toxicity of a particular crystal protein towards insect spec.es 
JO of different Orders (e.g.. Orders Upidoptera and Diptera). 

LCs.: the lethal concentration of crystal protein that causes 50% mortahty of the insects 

treated. 

LC, 5 : the lethal concentration of crystal protein that causes 95% mortality of the tnsects 
treated. 

15 Proper: A recognition she on a DNA sequence or group of DNA screes drat 
provide an expression control element for a structural gene and to whtch RNA 
polymerase specifically binds and initiates RNA synthesis (transcription) of that 
gene. 

Regeneration: The process of growing a plan, from a plan, cell (e.g., plant protoplast or 

20 explant). 

Structural Gene: A gene ma, is expressed ,0 produce a polypeptide. _ 
Transformation:. A process of introducing an exogenous DNA sequence (e.g„ a vector, 
a recombinant DNA molecuie) into a cell or protoplast in which ma, exogenous 
DNA is incorporated into a chromosome or is capable of autonomous rephcahon. 
Transformed Cel.: A cell whose DNA has been altered by the introduction of an 

exogenous DNA molecule into that cell. 
Transgene: An exogenous gene which when introduced into the genome of a host cell 
through a process such as transformation, electroporation, particle bombardment, 
and the like, is expressed by the host cell and integrated intone cells genome 
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such that the trait or traits produced by the expression of the transgene is inherited 
by the progeny of the transformed cell. 
Transgenic Cell: Any cell derived or regenerated from a transformed cell or derived 
from a transgenic cell. Exemplary transgenic cells include plant calU derived^ 

5 from a transformed plant cell and particular cells such as leaf, root, stem, eig.,, 

somatic cells, or reproductive (germ) cells obtained from a transgenic plant. 
Transgenic Plant: A plant or progeny thereof derived from a transformed plant cell or 
protoplast, wherein the plant DNA. contains an introduced exogenous DNA 
molecule not originally present in a native, non-transgenic plant of the same 

10 strain. The terms "transgenic plant" and "transformed plant" have sometimes been 

used in the art as synonymous terms to define a plant whose DNA contains an 
exogenous DNA molecule. However, it is thought more scientifically correct to 
refer to a regenerated plant or callus obtained from a transformed plant cell or 
protoplast as being a transgenic plant, and that usage will be followed herein. 

15 Vector: A DNA molecule capable of replication in a host cell and/or to which another 
DNA segment can be operatively linked so as to bring about replication of the 
attached segment. A plasmid is an exemplary vector. 

5.4 Probes And Primers 

20 In another aspect, DNA sequence information provided by the invention allows 

for the preparation of relatively short DNA (or RNA) sequences having the ability to 
specifically hybridize to gene sequences of the selected polynucleotides disclosed herein. 
In these aspects, nucleic acid probes of an appropriate length are prepared based on a 
consideration of a selected crystal protein gene sequence, e.g., a sequence such as that 

25 shown in SEQ ID NO:9, SEQIDNO:ll, SEQIDNO : 13, SEQE>NO:25, 
SEQ ID NO:27, or SEQ ID NO:29. The ability of such nucleic acid probes to specifically 
hybridize to a crystal protein-encoding gene sequence lends them particular utility in a 
variety of embodiments. Most importantly, the probes may be used in a variety of assays 
for detecting the presence of complementary sequences in a given sample. 
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to certain embodiments, it is advantageous to nse oligonucleotide primers. The . - 
sequence of such primers is designed using a polynucleotide of me present invention for 
use in denting, amplifying or mutating a defined segment of a crystal protein gene from 
B. thuringiensis using PCR™ technology. Segments of felted crystal protein ^nes from 
other species may also be amplified by FCR™ using such primers. 

To provide certain of the advantages in accordance with the present invention, a 
preferred nucleic acid sequence employed for hybridizarion stpdies or assays includes 
sequences drat ate complementary to at least a 14 ,0 30 or so long nucleotide stretch of a 
crystal protein-encoding sequence, such as that shown in SEQ ID N0:9, SEQIDNO'.H, 
SEQ ID NO-13, SEQ ID NO:25, SEQ ID NO:27, or SEQ ID NO:29. A sixe of at least 14 
nucleotides in length help* to ensure ma, Ure fmgrueut will be of sufficient length to fam 
a duplex molecule that is bom stable and selective. Molecules havtng complementary 
sequences over sketches greater than 14 bases in length are generally preferred, though, 
in order to increase stability and selectivHy of the hybrid, and thereby improve the qualtty 
and degree of specific hybrid molecules obtained. One will generally prefer to desrgn 
nucleic acid molecules having gene-complementary stretches of 14 to 20 nuc.eotides, or 
even longer where desited. Such fragments may be readily prepared by. for example, 
directly synthesizing the fragment by chemical means, by application of nucletc acrd 
reproduction technology, such as the PGR™ technology of U. S. Paten. Nos. 4,683,195, 
and 4,683,202, each specifically incorporated herein by reference, or by excising selected 
DNA fragments from recombinant plasmids conraining appropriate inserts and suttable 
restriction sites. 

5.5 Expression Vectors 

The present invention contemplates an expression vector comprising a 
polynucleotide of the present invention. Thus, in one embodiment an expression vector is 
an isolated and purified DNA molecule comprising a promoter operatively linked to an 
coding region that encodes a polypeptide of the present invention, which coding regK>n » 
operatively linked to a transcripnon-terminating region, whereby the promoter dnves the 
transcription of the coding region. 
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As used herein, the term -operative., linked" means that a promoter is connected . - 
„ an cod,n g region in sueh a .ay char .he of «* -«f *~ * <»**# 

md r egu,a,ed by rhar promote, Means for opera.ive.y f^:" * c ^ 

region are well known in the art. . . • 

Promo ,ers that function in bac.eria axe wen kuowMh tte art. An exe.up.ary and 
preferred promoter for the **** erysta! proteins include the s ig A, *R ****** 
promoters. Alternatively, the native, mutagenized, or recombinau, crysta, protem- 
encoding.genepromotersthemselvescan.be used, .' 

wle an expression vector of tire present iuveutiou is to be used .0 trausform a 
pla „«, a promos is selected ma. has .he abUity » drive expression in p.an<s. Promoters 
1 function in p.an.s are a!so weti known in .he arr. Usefu. m expressmg A. 
peptide in plants are promoters ,ba. are inducib.e, viral, synthetic, cons,,~ 
deled (Poszkowski e, ai., 1939; Ode.. « aL, .985), and temporally regu.ated, 
spatiallyregula,ed,andspatio-,em P oraUyregu 1 a.ed(Chau«a(., 1989). _ 

A promoter is also se.ec.ed for Us abitity to direct the trausformeri p.an« eeUs or 
U-ausgenic p.anfs transcriptiona, activity to the coding reglou. Structural genes can e 
uriven by a varie,y of promoters In p.au. tissues. Promoters can be near-con— such 
as the CaMV 35S promoter, or tissue-specific or deve.opmeu.aUy specific promorers 

affecting dicots or monocots. 

Where ,he promoter is a near-constitutive promo.er such as CaMV 35S, mcre.es 
„ po.ypep.ide expression are found m a variery of .ransformed p.an. tissues caUus 
J, seed and root,. Alternative*, tire effecrs of transformation can be 
specif, pian. tissues by using p.an, integrating vectors containing a tissue-spec.fic 

Pr0m0l I„ exemplary tissue-specific promorer is the lectin promoter, which is specific for 
seed tissue. The Lectin protein in soybean seeds is encoded by a single gene <UD that rs 
„„,y expressed during seed maturation and accounts for about 2 to about * of rota, see 
u^A. The .ectin gene and seed-specific promo.er have been fiti.y charac.er.ze and 
used .o direc, seed specific express.on in transgenic tobacco plants (Vodkin e, a,., .983, 
Lindstromee a/., 1990.) 
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An expression vector containing a coding region that encodes a polypeptide of . 
interest is engineered to be nnder con.ro! of the .ectin promoter and that vector ,s 
introduced into plants using, for example, a protoplast transforation method .(Mnr e.nl, 
1991). The expression of the polypeptide is directed specifically fo ; the seeds of the 

5 transgenic plant. - , 

A transgenic plant of the present invention produced from a plant cell transformed 
with a tissue specific promoter can be crossed with a second transgenic plan, developed 
from a plant cell transformed with adifferen. tissue specific promoter to prodnce a hybnd 
agonic plant that shows me effects of transformation in more than one specific tissue. 
10 Exemplary tissoe-specific promoters are com sncrose synthetase 1 (Yang er al, 

1990), com alcohol dehydrogenase 1 (Vogel e, al. 1989), com light harvesting complex 
(Simpson, 1986), com hea, shock protein (Odell e, al, 1985), pea small snbnnit RnBP 
carboxylase (Poulsen e, al., 1986; Cashmore e, al., 1983), Ti plasmid mannoptne 
synthase (Langridge e, al, 1989), Ti plasmid nopaline synthase (Langridge e, al, 1989), 
,5 petunia chalcone isomerase (Van Tunen « al, 1988), b«n glycine rich protein 1 (Keller 
e, al 1989), CaMV 35s transcript (Odell e, al, 1985) and Potato patatin (Wenzler e, al, 
m9). Preferred promoters are the cauliflower mosatc virus (CaMV 35S) promoter and 
the S-E9 small subunit RuBP carboxylase promoter. 

The choice of which expression vector and ultimately to which promoter a 
20 polypeptide coding region is operatively linked depends ditectly on the mnctiona. 
properties desired, e.g., the location and timing of protein expression, and the host cell to 
be transformed. .These are well known limitations inherent in the art of construcung 
recombinant DNA molecules. However, a vector useful in practicing the presen. 
invention is capable of directing the expression of the polypeptide coding region to winch 

25 it is operatively linked. 

Typical vectors useful for expression of genes in higher plants are well known ,n 
the art and include vectors derived from the tumor-inducing (Ti) plasmid of 
Agrobacurium lunufaclens described (Rogers e, al, 1987), However, several other plan. 
in.egm.ing vector systems are known to fuuc.ion in planls tncluding pCaMVCN transfer 
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control vector described (Fromm et a/., 1985). Plasmid pCaMVCN (available from _ 
Pharmacia, Piscataway, NJ) includes the cauliflower mosaic virus CaMV 35S promoter. 

In preferred embodiments, the vector used to express the polypeptide includes- a 
selection marker that is effective in a plant cell, preferably a%g i^<* 
5 marker. One preferred drug resistance marker is the gene whose expression results in 
kanamycin resistance; i.e., the chimeric gene containing the nopalhie synthase promoter, 
Tn5 neomycin phosphotransferase II (nptll) and nopaline synthase 3' non-translated 

region described (Rogers et ai, 1988). 

RNA polymerase transcribes a coding DNA sequence through a site where 

10 polyadenylation occurs. Typically, DNA sequences located a few hundred base pairs 
downstream of the polyadenylation site serve to terminate transcription. Those DNA 
sequences are referred to herein as transcription-termination regions. Those regions are 
required for efficient polyadenylation of transcribed messenger RNA (mRNA). 

Means for preparing expression vectors are well known in the art. Expression 

15 (transformation vectors) used to transform plants and methods of making those vectors 
are described in U. S. Patent Nos. 4,971,908, 4,940,835, 4,769,061 and 4,757,011, the 
disclosures of which are each specifically incorporated herein by reference. Those 
vectors can be modified to include a coding sequence in accordance with the present 
invention. 

20 A variety of methods has been developed to operatively link DNA to vectors via 

complementary cohesive termini or blunt ends. For instance,_complementary 
homopolymer tracts can be added to the DNA segment to be inserted and to the vector 
DNA. The vector and DNA segment are then joined by hydrogen bonding between the 
complementary homopolymeric tails to form recombinant DNA molecules. 

25 A coding region that encodes a polypeptide having the ability to confer 

insecticidal activity to a cell is preferably a chimeric B. thuringiensis crystal protein- 
encoding gene. In preferred embodiments, such a polypeptide has the amino acid residue 
sequence of SEQIDNO:10, SEQfDNO.12, SEQIDNO:14, SEQ ID NO:26, 
SEQ ID NO:28, or SEQ ID NO:30; or a functional equivalent of one or more of those 

30 sequences. In accordance with such embodiments, a coding region comprising the DNA 
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sequence of SEQ ID NO:9, SEQ ID NO: 11, SEQIDNQ:13, SEQ IDNO:25, : 
SEQ ID NO:27, or SEQ ID NO:29 is also preferred. 

5.6 Transformed Or Transgenic Plant Cells 

5 A bacterium, a yeast cell, or a plant cell or a plant transformed with ah expression 

vector of the present invention is also contemplated. A transgenic bacterium, yeast cell, 
plant cell or plant derived from such a transformed or transgenic cell is also 
contemplated. Means for transforming bacteria and yeast cells are well known in the art. 
Typically, means of transformation are similar to those well known means used to 

10 transform other bacteria or yeast such as E. coli or Saccharomyces cerevisiae. 

Methods for DNA transformation of plant cells include Agrobacteriur^rmdi^d 
plant transformation, protoplast transformation, gene transfer into pollen, injection into 
reproductive organs, injection into immature embryos and particle bombardment. Each 
of these methods has distinct advantages and disadvantages. Thus, one particular method 

15 of introducing genes into a particular plant strain may not necessarily be the most 
effective for another plant strain, but it is well known which methods are useful for a 

particular plant strain. 

There are many methods for introducing transforming DNA segments into cells, 
but not all are suitable for delivering DNA to plant cells. Suitable methods are believed 
20 to include virtually any method by which DNA can be introduced into a cell, such as by 
Agrobacterium infection, direct delivery of DNA such as, for example, by PEG-mediated 
transformation of protoplasts (Omirulleh et al, 1993), by desiccatipn/inbibition-mediated 
DNA uptake, by electroporation, by agitation with silicon carbide fibers, by acceleration 
of DNA coated particles, etc. In certain embodiments, acceleration methods are preferred 
25 and include, for example, microprojectile bombardment and the like. 

Technology for introduction of DNA into cells is well-known to those of skill in 
the art. Four general methods for delivering a gene into cells have been described: (1) 
chemical methods (Graham and van der Eb, 1973); (2) physical methods such as 
microinjection (Capecchi, 1980), electroporation (Wong and Neumann, 1982; Fromm 
30 etal, 1985) and the gene gun (Johnston and Tang, 1994; Fynan etai, 1993); (3) viral 
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vectors (Clapp, 1993; Lu etai, 1993; Eglitis and Anderson, 1988a; 198.8b); and (4) _ 
receptor-mediated mechanisms (Curiel et al, 1991; 1992; Wagner et «/., 1992). 

5.6.1 Electroporation /;>;.. •:*..>... 

The application of brief, high-voltage electric -pulses' fe a yar|# of *a|iniai;^ J; 
plant cells leads to the formation of nanometer-sized pores in the piasma membrane. 
DNA is taken directly into the cell cytoplasm either through these pores or as a 
consequence of the redistribution of membrane components mat accompanies d^^^ 
the pores. Electroporation can be extremely efficient and can be used both for transient 
expression of clones genes and for establishment of cell lines that carry integrated copies 
of the gene of interest. Electroporation, in contrast to calcium phosphate-mediated 
transection and protoplast fusion, frequently gives rise to cell lines that carry one, or at 
most a few, integrated copies of the foreign DNA. 

The introduction of DNA by means of electroporation, is well-known to those of 
skill in the art. In this method, certain cell wall-degrading enzymes, such as pectin- 
degrading enzymes, are employed to render the target recipient cells more susceptible to 
transformation by electroporation than untreated cells. Alternatively, recipient cells are 
made more susceptible to transformation, by mechanic^ wounding. To effect 
transformation by electroporation one may employ either friable tissues such as a 
suspension culture of cells, or embryogenic callus, or alternatively, one may transform 
immature embryos or other organized tissues directly. One would partially degrade the 
cell walls of the,.chosen cells by exposing them to pectin-degrading enzymes (pectolyases) 
or mechanically wounding in a controlled manner. Such cells would then be recipient to 
DNA transfer by electroporation, which may be carried out at this stage, and transformed 
cells then identified by a suitable selection or screening protocol dependent on the nature 
of the newly incorporated DNA. 

5.6.2 Microprojectile Bombardment 

A further advantageous method for delivering transforming DNA segments to 
plant cells is microprojectile bombardment. In this method, particles mafbe coated with 
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nucleic acids and delivered into cells by a propelling force. Exemplary partieles ; include 
those comprised of tungsten, gold, platinum, and the like. . , : 

An advantage of microprojectile bombardment, m . 
means of reprdducibly stably transforming monocdts, is that hei#r the :isolatioh>of 

5 protoplasts (Cristoii el ai, 1988) nor the suscepibiiity- ^||ff ^^If^f^f 
required. An illustrative embodiment of a method for delivering DNA into maize cells by 
acceleration is a Biolistics Particle Delivery System, which can be used to propel particles 
coated with DNA or cells through a screen, such as a stainless steel or Nytex screen, onto 
a filter surface covered with corn cells cultured in suspension. The screen disperses the 

10 particles so that they are not delivered to the recipient cells in large aggregates. It is 
believed that a screen intervening between the projectile apparatus and the cells to be 
bombarded reduces the size of projectiles aggregate and may. contribute to a higher 
frequency of transformation by reducing damage inflicted on the recipient cells by 

projectiles that are too large. 

15 For the bombardment, cells in suspension are preferably concentrated on filters or 

solid culture medium. Alternatively, immature embryos or other target cells may be 
arranged on solid culture medium. The cells to be bombarded are positioned at an 
appropriate distance below the macroprojectile stopping plate. If desired, one or more 
screens are also positioned between the acceleration device and the cells to be 

20 bombarded. Through the use of techniques set forth herein one may obtain up to 1000 or " 
more foci of cells transiently expressing a marker gene. The number of cells in a focus 
which express the exogenous gene product 48 hours post-bombardment often range from 

1 to 10 and average 1 to 3. 

In bombardment transformation, one may optimize the prebombardment culturing 

25 conditions and the bombardment parameters to yield the maximum numbers of stable 
transformants. Both the physical and biological parameters for bombardment are 
important in this technology. Physical factors are those that involve manipulating the 
DNA/microprojectile precipitate or those that affect the flight and velocity of either the 
macro- or microprojectiles. Biological factors include all steps involved in manipulation 

30 of cells before and immediately after bombardment, the osmotic adjustment of target cells 
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to help alleviate the trauma associated with bombardment, and also the nature; of ithe^ 
transforming DNA, such as linearized DNA or intact supercoiled plasrnicts. It is believed 
that pre-bombardment manipulations are espe^Ml* 
transformation of immature embryos. 

Accordingly, it is contemplated that one may '^^•^i^f^^f^;-.' 
bombardment parameters in small scale studies to fully optimize the conditions: One 
may particularly wish to adjust physical parameters such as gap distance, flight distance, 
tissue distance, and helium pressure. One may also minimize the trauma reduction 
factors (TRFs) by modifying conditions which influence the physiological state of the 
recipient cells and which may therefore influence transformation and integration 
efficiencies. For example, the osmotic state, tissue hydration and the subculture stage or 
cell cycle of the recipient cells may be adjusted for optimum transformation. The 
execution of other routine adjustments will be known to those of skill in the art in light of 

the present disclosure. 

The methods of particle-mediated transformation is well-known to those of skill 
in the art. U. S. Patent No. 5,015,580 (specifically incorporated herein by reference) 
describes the transformation of soybeans using such a technique. 

5.6.3 Agrobacterium-Mediated Transfer 

Agrobacterium-medtiteA transfer is a widely applicable system for introducing 
genes into plant cells because the DNA can be introduced into whole plant tissues, 
thereby bypassing the need for regeneration of an intact plant from a protoplast. The use 
of Agrobacterium-medtezd plant integrating vectors to introduce DNA into plant cells is 
well known in the art. See, for example, the methods described (Fraley et al, 1985; 
Rogers et al., 1987). The genetic engineering of cotton plants using Agrobacterium- 
mediated transfer is described in U. S. Patent No. 5,004,863 (specifically incorporated 
herein by reference), while the transformation of lettuce plants is described in U. S. Patent 
No. 5,349,124 (specifically incorporated herein by reference). Further, the integration of 
the Ti-DNA is a relatively precise process resulting in few rearrangements. The region of 
DNA to be transferred is defined by the border sequences, and intervening DNA is 
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usually inserted into the plant genome as described (Spielmann etaU 1986; Jorgensen 
a/., 1987). 

Modern Agrotemwm transformation vectors are capable ^h^ion in ^E. coli 
as well as Agrobacterium, allowing for convenient 'pg^^^^^^^- 

5 al, 1985). Moreover, recent technological advances in v^t^^^^#^^-/ : ^ 
mediated gene transfer have improved the arrangement of genes and restriction sites ih 
the vectors to facilitate construction of vectors capable of expressing various polypeptide 
coding genes. The vectors described (Rogers et al., 1987), have convenient multi-linker 
regions flanked by a promoter and a polyadenylation site for direct expression of inserted 

10 polypeptide coding genes and are suitable for present purposes. In addition, 
Agrobacterium containing both armed and disarmed Ti genes can be used for the 
transformations. In those plant strains where Agrobacterium-me&iaXtd transformation is 
efficient, it is the method of choice because of the facile and defined nature of the gene 
transfer. 

15 Agrobacterium-medizted transformation of leaf disks and other tissues such as 

cotyledons and hypocotyls appears to be limited to plants that Agrobacterium naturally 
infects. Agrobacterium-mcdMed transformation is most efficient in dicotyledonous 
plants. Few monocots appear to be natural hosts for Agrobacterium, although transgenic 
plants have been produced in asparagus using Agrobacterium vectors as described 

20 (Bytebier et al., 1987). Therefore, commercially important cereal grains such as rice, 
com, and wheat must usually be transformed using alternative methods. However, as 
mentioned above, the transformation of asparagus using Agrobacterium can also be 
achieved (see, for example, Bytebier et al. , 1 987). 

A transgenic plant formed using Agrobacterium transformation methods typically 

25 contains a single gene on one chromosome. Such transgenic plants can be referred to as 
being heterozygous for the added gene. However, inasmuch as use of the word 
"heterozygous" usually implies the presence of a complementary gene at the same locus 
of the second chromosome of a pair of chromosomes, and there is no such gene in a plant 
containing one added gene as here, it is believed that a more accurate name for such a 



A. 595S7<l9Z7<>i!.DOC) 



-64- 



10 



15 



20 



25 



plant is an independent segregant, because the add,d, exogenous gene segregates 

independently during mitosis and meiosis. 

More preferred is a transgenic plant that is homozygous for the added, structured 

gene; i.e.,, transgenic plant that contains two added genes, one gene at the same locus o:n 
each chromosome of a chromosome pair. A homozygous transgenic plant can, ove 
obtained by sexually mating (selfihg) an independent segregant transgenic plMt fteat 
contains a single added gene, germinating some of the seed produced and analyzing toe 
resulting plants produced for enhanced carboxylase ve^^eto a cdntrol (nativee, 
non-transgenic) or an independent segregant transgenic plant. 

It is to be understood that two different transgenic plants can also be mated ::o 
produce offspring that contain two independently segregating added, exogenous geness, 
Selfing of appropriate progeny can produce plants that are homozygous for both addecc. 
exogenous genes that encode a polypeptide of interest. Back-crossing to a parental plant 
and out-crossing with a non-transgenic plant are also contemplated. 

Transformation of plant protoplasts can be achieved using methods based o?n 
calcium phosphate precipitation, polyethylene glycol treatment, electroporation, amd 
combinations of these treatments (see, e.g., Potrykus et al, 1985; Lorz et al, 198:5; 
Fromm etal, 1985; Uchimiya et al, 1986; Callis et al, 1987; Marcotte et al., 1988). 

Application of these systems to different plant strains depends upon the ability :o 
• regenerate that particular plant strain from protoplasts. Dlustrative methods for toe 
regeneration of cereals from protoplasts are described (Fujimura et al, 1985; Toriyama et 
al, 1986; Yamada et al, 1986; Abdullah et al, 1986). . . . 

To transform plant strains that cannot be successfully regenerated frorm 
protoplasts, other ways to introduce DNA into intact cells or tissues can be utilized. F or 
example, regeneration of cereals from immature embryos or explants can be effected as 
described (Vasil, 1988). In addition, "particle gun" or high-velocity microprojectoe 
technology can be utilized (Vasil, 1992). 

Using that latter technology, DNA is carried through the cell wall and into toe 
cytoplasm on the surface of small metal particles as described (Klein et al, 1987; Klein et 
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al, 1988; McCabe et al, 1988). The metal particles penetrate through several layers : of, 
cells and thus allow the transformation of cells within tissue explants. 

5.7 Production of Insect r Resistant Transgenic Plants 

thus, the amount of a gene coding for a polypeptide of interest (i:e., a bacterial -j 
crystal protein or polypeptide having insecticidal activity against one of more insect 
species) can be increased in plant such as corn by transforming those plants using particle 
bombardment methods (Maddockef al, 1991). By way of example, an expression vector 
containing a coding region for a B. thuringiensis crystal protein and an appropriate 
selectable marker is transformed into a suspension of embryonic maize (corn) cells using 
a particle gun to deliver the DNA coated on microprojectiles. Transgenic plants are 
regenerated from transformed embryonic calli that express the disclosed insecticidal 
crystal proteins. Particle bombardment has been used to successfully transform wheat 
(Vasil etal, 1992). 

DNA can also be introduced into plants by direct DNA transfer into pollen as 
described (Zhou et al, 1983; Hess, 1987; Luo et al, 1988). Expression of polypeptide 
coding genes can be obtained by injection of the DNA into reproductive organs of a plant 
a= described (Pena et al, 1987). DNA can also be injected directly into the cells of 
immature embryos and the rehydration of desiccated embryos as described (Neuhaus et 

al, 1987; Benbrook etal, 1986). 

The development or regeneration of plants from either single plant protoplasts or 
various explants is well known in the art (Weissbach and Weissbach, 1988). This 
regeneration and growth process typically includes the steps of selection of transformed 
cells, culturing those individualized cells through the usual stages of embryonic 
development through the rooted plantlet stage. Transgenic embryos and seeds are 
similarly regenerated. The resulting transgenic rooted shoots are thereafter planted in an 
appropriate plant growth medium such as soil. 

The development or regeneration of plants containing the foreign, exogenous gene 
that encodes a polypeptide of interest introduced by Agrobacterium from leaf explants 
can be achieved by methods well known in the art such as described (Horsch et al, 1985). 
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In this procedure, transformants are cultured m^ vr^^^^^^^%_ 
medium that induces the regeneration of shoots in the plant si|ih being tr^sfdrmed k 
described (traley ai, 1983) In particular, U. S. Patent %.J,349, ; 1 24- details the 
cioh of^eneticaily transformed lettdce andTplants resulting therefrom which 
express hybrid crystal proteins confer^ insaaicidal activity-a^bepido^eranUarvae 
to such plants, ! • ' • 'V . v-,,,-- ,■ 1 r-§r-> V .-. 

This procedure typically produces shoots within two to four months and those 
shoots are then transferred to an appropriate root-inducing medium containing the 
selective agent and an antibiotic to prevent bacterial growth. Shoots that rooted in the 
presence of the selective agent to form plantlets are then transplanted to soil or other 
media to allow the production of roots. These procedures vary depending upon the 
particular plant strain employed, such variations being well known in the art. 

Preferably, the regenerated plants are self-pollinated to provide homozygous 
transgenic plants, as discussed before. Otherwise, pollen obtained from the regenerated 
plants is crossed to seed-grown plants of agronomical^ important, preferably inbred 
lines. Conversely, pollen from plants of those important lines is used to pollinate 
regenerated plants. A transgenic plant of the present invention containing a desired 
polypeptide is cultivated using methods well known to one skilled in the art. 

A transgenic plant of this invention thus has an increased amount of a coding 
region (e.g., a cry gene) that encodes the Cry polypeptide of interest. A preferred 
transgenic plant is an independent segregant and can transmit that genejmd its activity to 
its progeny. A more preferred transgenic plant is homozygous for that gene, and 
transmits that gene to all of its offspring on sexual mating. Seed from a transgenic plant 
may be grown in the field or greenhouse, and resulting sexually mature transgenic plants 
are self-pollinated to generate true breeding plants. The progeny from these plants 
become true breeding lines that are evaluated for, by way of example, increased 
insecticidal capacity against Coleopteran insects, preferably in the field, under a range of 
environmental conditions. The inventors contemplate that the present invention will find 
particular utility in the creation of transgenic corn, wheat, oats, barley, other grains, 
vegetables, fruits, fruit trees, berries, turf grass, ornamentals, shrubs andTrees. 



-67- 



A: 59587O9Z70I! DOC) 



6. Examples 

The following examples are included to demonstrate preferred embodiments of 
the invention. It should be appreciated by those &^^]^&Wffi- 
disclosed in the examples which follow represent techniques discovered by -the inventor 
to function well in the practice of the invention, arid thus can be considered to constitute 
preferred modes for its practice. However, those of skill in the art should, in light of the 
present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without 
departing from the spirit and scope of the invention. 

6.1 Example 1 Construction of Hybrid B. thuringiensis 5-Endotoxins 

The B. thuringiensis shuttle vectors P EG853, P EG854, and pEG857 which are 
used in the present invention are described by Baum et al, 1990. The plasmid pEG857 
contains the CrylAc gene cloned into P EG853 as an Sphl-BamHl DNA fragment. 
pEG1064 was constructed for purposes of the present invention in such a way that the 
Kpnl site within the CrylAc gene was preserved and the Kpnl site in the P EG857 multiple 
cloning site (MCS) was eliminated. This was accomplished by sequentially subjecting 
P EG857 DNA to limited Kpnl digestion so that only one Kpnl site is cut, filling in the 
Kpnl 5' overhang by Klenow fragment of DNA polymerase I to create blunt DNA ends, 
and joining the blunt ends of DNA by T4 DNA ligase. P EG318 contains_the CrylF gene 
(Chambers et aU 1991) cloned into the Xhol site of pEG854.as a Xhol-SaH DNA 
fragment. P EG315 contains the CrylC gene from strain EG6346 (Chambers efal, 1991) 
cloned into the Xhol-BamUl sites of pEG854 as a SaH-Bamm DNA fragment. 

PIG. 1A shows a schematic representation of the DNA encoding the complete 
CrylAc, CrylAb, CrylC, and CrylF genes contained on P EG854/pEG1064, P EG20, 
P EG315, and pEG318, respectively. Unique restriction sites that were used in 
constructing certain hybrid genes are also shown. FIG. IB shows a schematic 
representation of hybrid genes pertaining to the present invention. In some cases standard 
PCR™ amplification with mutagenic oligonucleotide primers were used to incorporate 
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appropriate restrictions sites into. DNA fragments used for hybrid, gene construction.^ 
Certain hybrid gene constructions could hot be accomplished by restriction fragment 



5 (purchased from Integrated DNA Technologies Inc., Coralville, IA) weremsed:;. ; 
Primer A 

5'-GGATAGCACTCATCAAAGGTACC-3' (SEQ ID NO:l) 
Primer© 

S^GAAGATATCCAATTCGAAG AGTTTCCC-3' (SEQ ID NO:2) 

10 Primer C 

S'-GATATTCTGCCTCGAGTGTTGCAGTAAC-S' (SEQ ID NO:3) 

Primer D 

5'-CCCGATCGGCCGCATGC-3' (SEQ IDNO:4) 
Primer E 

15 5'-CATTGGAGOTGTCCATG 3' (SEQ ID NO:5) 
Primer F 

5'-GCACTACGATGTATCC-3' (SEQ ID NO:6) 
Primer G 

5'-CATCGTAGTGCAACTCTTAC-3' (SEQ ID NO:7) 

20 Primer H 

5'-CCAAGAAAATACTAGAGCTCTTGTTAAAAAAGGTGTTCC-3' (SEQ ID NO:8) 

Primer I 

5'-ATTTGAGTAATACTATCC-3' (SEQ ID NO:23) 
Primer J 

25 5'-ATTACTCAAATACCATTGG-3' (SEQ ID NO:24) 

The plasmids described in FIG. IB containing the hybrid 5-endotoxin genes 
pertinent to this invention are described below. Isolation or purification of DNA 
fragments generated by restriction of plasmid DNA, PCR™ amplification, or POE refers 
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,„ th e sequential application of agarose-TAE ge. e,ec.rqpho # s an,.u*of n.CW.ean 
Kit (Bio .0,) following to manufacturer's recommendation pEG.065 was consumed 
by PCR™ amplification of the Crvfr-' DNA fragment using prune, pair A and B and 
p BG3>8 as ,he DNA ,emp.a K . The tesuhtng PCR™ product was isolated, cu, with ,U,d, 
and K„nV and used to tepiace the correspondtug Ar.t, A>i IWA ^ * 
mm^mr** constructed using WE and DNA fragments Saul-Kpnlot CrylF 
and Ar»n- CW of C*Mc tha, were isoiaKd from P EG3 18 and pBG8 5 7, respecrvely 
The resuUing POE product was PGR- ampHf^wuh primer pair A and B, 
M and K P nl, and used to tepiace the corresponding As»n-Kp»I fragment in p E G85Z 

PEG1068 was consented by replacing the Social DNA fragment of CrylAc 
.seated from P EG857 with the correspond^ **** DNA fragment 
Cry* (PEG3.8,. pEG.070 was constructed by rep.acing *e Sa&4£pn} DNA fragment 
isol a,ed from pEG.065 with the corresponding **** DNA fragment Mated from 
CrylAc <pEG857, pEGiOn was cot— by replacurg the **** DNA ragm- 
Jated from pEG>067 with the corresponding **** DNA fragment tsolated fr cm 
C„Mc (PEG857). P EG1074. pEG1076, and pEG1077 were constructed by repiacmg Are 
DNA fragment from pEG.064 wi, the PCR™ amplified »HM DNA 
fragmen, from pEG.065, pEGi067, pEG.068, respecrivefy, using primer pa* Cauda 
pEG10 89 was constructed by replacing the SpHl-Sacl DNA fragment of P EG1064 w«h 
th e isolated and S„M and « cut PCR™ product of CrylF that wa^generated ustng 
primer pair D and E and the template pEG318. _ 

PEG 1091' was consumed by replacing the SpHl-Sacl DNA fragment of pEG1064 
with the iso.a.ed and S P Hl and Sad cut PCR™ product of CrylC tha, was-generated ustng 
primer pair D and H and the template pEG3 1 5 

p EO1 088 was constructed by POE using a CrylAc DNA fragmen. generated 
asing pnmer pair B and F and a CryiCDNA fragment generated ustng primer parr A and 
G. The So*** fragnren. was iS o.a.ed from the tesu.ring POE product and used to 
repl ace the corresponding Sacl-Kpnl fragment in pEG1064. 

PEG365 was cons.ruc.ed by first replacing the SpHl-Kpnl DNA fragment from 
pEG106 5 with .he copending CryMo DNA fragment tsola.ed from pEG20 .o g,ve 
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pEG364. The Sacl-Kpnl DNA fragment from pEG364 was then .replaced with the 
corresponding CrylF DNA fragment isolated from pEG31&. 

pE G1092 was constructed by replacing the Kpiil-Barnill DNA 
pEG 1088 with ;the corresponding^ DNA ; f^|n^^i^ia^*^i^^^^- ^^^^^ 
distinct from the crylAb/crylC hybrid 5-endotoxin gene disclosed in Intl. Pat ^t*g 
No. WO 95/06730. 

pEG1093 was constructed by replacing the Sphl-AsuE DNA fragment from 
pEG1068 with the corresponding SphhAsuU DNA fragment isolated from pEG20. 

pEG378 was constructed by POE using a crylAc DNA fragment generated using 
primer pair B and I using pEG857 as the template and a cry IF DNA fragment generated 
using primer pair A and J using pEG318 as the template. The resulting POE product was 
cut with AsiM and Kpnl and the resulting isolated DNA fragment used to replace the 
corresponding Asull-Kpnl DNA fragment in pEG1064. 

6.2 Example 2 - Production of the Hybrid Toxins In B. thuringiensis 

The plasmids encoding the hybrid toxins described in Example 1 were 
transformed into B. thuringiensis as described by Mettus and Macaluso, 1990. The 
resulting B. thuringiensis strains were grown in 50 ml of C-2 medium until the culture 
was fully sporulated and lysed (approximately 48 hr.). Since crystal formation is a 
prerequisite for efficient commercial production of 5-endotoxins in B. thuringiensis, 
microscopic analysis was used to identify crystals in the sporulated cultures (TABLE 3). 
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Strain 



TABLE 3 

Crystal Formation by the Hybrid 8-Endotoxhw 
Plasmid 



NRRL a NRRL 
Accession Deposit Date 



Parent 8-Endotoxins 



Crystal 
Formation 



b(jrl lUoU 




P EG1065 


CrylAc + CrylF 


EGllUoZ 




P EG1067 


Cryl Ac + CrylF 


EG11063 


t>-Z 1 J / 7 


May 24, 1996 pEG1068 


CrylAc + CrylF 


EG11065 


•- 


P EG1070 


Cryl Ac + CrylF 


EG11067 




pEG1072 


CrylAc + CrylF 


EGH071 




pEG1074 


Cryl Ac + CrylF 


EGllO/3 




pEG1076 


Cryl Ac + CrylF 


EGH074 


B-21580 


May 24, 1996 pEG1077 


CrylAc + CrylF 


EGH087 




pEG1088 


CrylAc + CrylC 


EGH088 




pEG1089 


CrylF + Cryl Ac 


EGH090 




pEG1091 


CrylC + CrylAc 


EGH091 




pEG1092 


CrylAc + CrylC 


EGH092 


B-21635 


Oct. 21, 1996 pEG1093 


CrylAb + CrylAc + CrylF 


EGH735 


B-21581 


May 24,1996 pEG365 


CrylAb + CrylF + Cryl Ac 


EGU751 


B-21636 


Oct. 21, 1996 pEG378 


Cryl Ac + CrylF 



counterparts of the subject appltcaoon, or tts ^J^f*™^^ the subject 
nnderstood that 4. — o jdj-*- « — ^'X L subject culttne 
invention in derogation of patent ngnts panreo , t provisions of the 

deposits will be stored and made aval able to the P uM c m aecrrrt mm > f 
Budapest Treaty for the Deposit of Mtcroorgantsms te ftey vnl 

necessary to keep drem « and ™«^^f^^ ta a n^ for a period of at 
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The endotoxin production for some of .he B. **** — ™** 
3 was examined by sodium dodecy. sulfafc-po.yacrylamide gel dectrophotesis (SDS-PAGE) - 
described by Baum - a,., .990. E,ua. volume cultures of each B. -» w«e 

U m « medium - Amy sporulated and iysed. * — - - — - - * 
le/crystal pehet was washed twice with euua. voiumes of dHhd detomzed wate The W 
^Z^dedmb.fm.embamvo.umeofO.^TmonX-.OO- An et.ua. vo.nme of 

each washed culture was analyzed by SDS-PAGE as shown rn FIG. 2. 

fhemajoatyofbybndsmvo.vmsCty.AcaMCry.Ffortneds.b.ec^smB. 

u r«ni«S1 Two of the three hybrids involving CrylAc and CrylC, 
reciprocal exchange of EG11063. two or me / 

HO U and EG1.090, failed to produce crysud m B. »-» -J**^ 
reciprocal hybrids mimic the activated toxin fragments of crystal— EGH063 and 

^ve,, strain that was examined by SDS-PAGE produced some ,evel of 5-endotoxin As 
expect hlever, those cultures identified as crysta. negative produced very Me protetn (eg., 
^OUOoS lane. EG..067, lane , EO1.0S8, and lane , EGU090, Forte^e, 

SendotoxinsproducecomparablelevelsofproteinmclumngEGllOSOOaneb^bO. . 

depending on the parent 8-endotoxins used to construct the hybnd. 

63 Example 3 -Proteolytic Processing of the Hybrid 5-Endotoxins 

pIU degradation of me protoxin form of me 0— to a stable active « 
eceurs once a.ndotoxin crystals a« solubilized in the .arval midgut. One »«- 
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potential activity of 8-endotoxins is the stability of the active endotoxin in a proteose 
To test ft. proteolytic sensitivity of the hybrid ****** solubU^ toxt» was 
peered to trypsin digestion. The 8-endotoxins wete purified from spotted B. 
outturns and ,uanti fi ed as deseribed b, Charts at .991. ExaeUy 250 pg of each hybn 
endotoxin crysta, was solubuized in 30 tnM NaHC0 3 , 10 mM DTT (^volume 0.5 ml). 
Trypsin was added to the so.ubilized toxin a. a 1:10 ratio. At appropriate time potnts 50 pi 
aUouots were removed ,o 50 p! Uemmli buffer, heaftd to ,00-C for 3 mia, and frozen tn a 
d^-ice ethano. haft for subsetment analysis. The trypsin diges* of fte sombilized toxms v^ 
^yzed by SDS-PAGE and fte amount of active endotoxin at each time pain, was , pantified 
by densitometiy. A graphie rep—on of fte results from ftese stitdies are shown tn FIG 3. 
The wild-type CrylAc is rapidly processed ft fte active endotoxin fragment ftat 
for fte duration of fte study. The hybrid o-endotoxins from EG.1063 and HGUOM - 
atso processed to active 8-endotoxin frogmen* which are stable for fte duration of fte sftd * 
The processing of fte EG.1063 8-endotoxin occur, at a slower rate and ahigher percentage of 
^Lvec.Ltoxftf^entmmainsateachtimepoin, Alftough fte hybrid 8— 
from EG.1060 and EG1.062 ate process ,o active endotoxin fragments, ftese frar^ents - 
more susceptible ft fhrfter cleavage and degrade a, various rates during fte course of fte 
Tbe.exclgepoft.hetwecnC^cartdC^forfteEGUO^andEG.lO^ndo^ 

^tftftxinsftatdiffebyon^I amino acid residues (see FIG. 1). 
of maintaining CrylAc scuences a, ftese positions is evident by fte 

fte EG1.062 8-endotoxin. These daft demonstiaft fta, different hybnd 8— ^ 
constiucted using fte same parental 8-endotoxins can vary significantly m btochemtca. 
characteristics such as proteolytic stability. 

6.4 Example 4 - Bioaetivity of the Hybrid 8-Endot.xms 

B .Hunnpe^ cultures expressing fte desired 8-endotoxin were growr, until fully 
spomlaftd and lysed and washed as described in Example 2. The 8-endotoxin levels for each 
Ire were ratified by SDS-PAGE as described by Baum ,r a,., .990. tn fte case of 
oioassay screens, a singie appropriate concentration of each washed 8-endotoxin cmture was 
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topically applied to 32 wells containing 1.0 ml artificial diet per well (surface area of 175 mm ). 
A single neonate larvae was placed in each of the treated wells a*, the tray covered by a clear 
perfotated mylar sheet. Larvae mortality was scored after 7 days of feeding and percent 
mortality expressed as the ratio of the nnmber of dead lame to the total nomber of larvae treated 

(32). 

la the case of LC W determinations (8-endotoxin concenttation giving 50% mortaltty), 
Endotoxins were purified ftom fire B. thurin&nsU cultures and quantified as described by 
Chambers a, al, 1991. Eight concentrations of the Endotoxins were prepared by senal dtlutton 
to 0005% Triton X-100® and each concentration was topically applied to wells contemn* 1.0 
of artificial diet. Larva, mortality was scored after 7 days of feeding (32 larvae for each 
endotoxin concentration). In all eases the diluent served as the control. 

A comparison of the Cryl A/CrylF hybrid toxins by bioassay screens is shown m TABLE 
4 The hybrid Endotoxins ftom strains EOl 1063 and EOl 1074 maintain tire activities of the 
parental CrylAe and CrylF Endotoxins. Furthermore, the hybrid Endotoxin ftom EQU735 
stains ft. activity of its parental Cryl Ab and CrylF Endotoxins. The Endotoxins produce 
by strains EO11061, EG11062, EG11071, and EG11073 have no insecticidal acttvtty on the 
tosec. larva, tested despite 1) being comprised of at least on. parental Endotoxin that is active 
against tire indicated larvae aod 2) forming suable, weU-defined crystals to B. 
These results demonstrate the unpredictabl. nature of hybrid toxin constructions. 

For the data to TABLE 4. All strains were tested as washed sporulated cultures. For 
.ach insect tested, equivalent amounts of Endotoxins were used and insecticidal activity was 
based on the strain showing the highest percent mortality (++++). 
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10 



Strain 

CrylAc 



Bioassay Screens of Hybrid CrylA/CrylF S-Endotoxins 
~S.frugtpefda %eXigu<* ^viresce^: H^m ^V.™^, 



CrylAb 

EG 11 060 

EG11062 

EG11063 

EG11071 

EG 11073 

EG11074 

EG11090 

EG11091 

EG11092 

EG 11735 

EG11751 



++++ 



++++ 



+- l l + 



++++ 



N.D. a 



++++ 
++++ 



N.D. a 



N.D. a 
N.D. a 
N.D. a 
N.D. a 



a N D. = not determined. 

The 5-endotoxins described in FIG. 1 and that demonstrated insecticidal activity 
in bioassay sereens were tested as purified oystals «o determine their LC 5 „ (see TABIX 
5) The endotoxins purified from strains EG1 1063, EG. 1074. EG1 109X and EG1 1735 
a,, show increased armywon. »fiu*** - * -mpamd to any of 

,he wi.d-.ype 8-endotoxins Ml The EG1 .063 and EGU074 8-endotoxins would yreld 
identical active toxin fragment (refer .0 FIG. IB) which is evrdent by their sitm.ar 1X50 
values on me insect examined. An unexpected result evident from these data » that a 
hybrid endotoxin such as EG.1063. EG11092. EG11074, EG11735. or BOUTS! can 
retain the activity of .heir respective parental Endotoxins, and, against certam msects 
such as S. ecisua, can have activity far better man either parenta, Endotoxin. Thrs broad 
range of insecticidal activity at doses close ,0 or iower than the parent* 8-endotoxms, 
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10 



i, Wi* .he wud-rvpe le*el pI ^,on W ,^ 

,JL* »** for produeuon « IS S 

• w c frueiverda and S."e»g"a than its parental 

^endotoxin has better activity against S. frugiperaa .9 ^ 

„ ,: . . w 7ea activity attributable to its Cry 1 Ac 
^endotoxins, it has lost the H. vtrescens H. m ^ 

' , inwpr toxin Yield observed for the EG 11 091?' 

parent This restricted host range along with lower toxin yi.e ,, 

" « , ^ ™k e it less amenable to production in B. thuringiensis 
8-endotoxin (see Example 2) make it less amenau 

TABLE 5 

LC 50 Values for the Purified Hybrid 5-Endotpxin a 

IJ^Ts^^ 




17 

> 10000 

51 
7 



219 >10000 



nd 



ta TABLE 5, ,he LC s o values are expressed in uanograms of purified o-endofoxio 
per we,. (175 nV) aud aje «he eomposue v^ues for 2 ro 6 rep.icafious. ud = no. 
determined. 
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10 



15 



Plasmid 


-SEQ ID N 
O: 


pEG1065 


15 


P EG1067 


16 


pEG1068 


17 


pEG1070 


15 


pEG1072 


16 


pEG1074 


15 


pEG1076 


16 


pEG1077 


17 


pEG1088 


18 


pEG1089 


19 


pEG109l 


20 


pEGi092 


18 


pEG11093 




pEG365 


17 


P EG378 





TATCCAATTCGAACGTCATC 

TTTAGTCATCGATTAAATCA 

ATAATAAGAGCTCCAATGTT 

TATCCAATTCGAACGTCATC 

TTTAGTCATCGATTAAATCA 

TATCCAATTCGAACGTCATC 

TTTAGTCATCGATTAAATCA 

ATAATAAGAGCTCCAATGTT 

TACATCGTAGTGCAACTCTT 

TCATGGAGAGCTCCTATGTT 

TTAACAAGAGCTCCTATGTT 

TACATCGTAGTGCAACTCTT 

ATAATAAGAGCTCCAATGTT 



21 
21 
21 
19 
19 



22 



21 
21 
21 



ACTACCAGGTACCTTTGATG 

ACTACCAGGTACCTTTGATG; 

ACTACCAGGTACCTTTGATG 

TCATGGAGAGCTCCTATGTT 

TCATGGAGAGCTCCTATGTT 

TGCAACACTCGAGGCTGAAT 

TGCAACACTCGAGGCTGAAT 

TGCAACACTCGAGGCTGAAT 

ACTACCGGGTACCTTTGATA 



ACTACCAGGTACCTTTGATG 
ACTACCAGGTACCTTTGATG 
ACTACCAGGTACCTTTGATG 



TABLE 6 describes the DNA surrounding Ore 5' and V exchange points for the 
hyb „d S-endotoxms whtch are pertinent to the present rnvenrion. As evident hy the SEQ 
ro NO certain hyhrid Endotoxins share exchange sites. In certain instances, .he 
exchange site is indistmguishable from one of the parent Endotoxins. This is the case 
for the V exchange site for pEG 10 74, pEG.076. and P EO1077 and .before, they have 
not been assigned a separate sequence identifier (SEQ ID NO.). ' - 

To examine the effect of other small changes in the exchange ,.e chosen for 

♦t, o^Hvitv of EG1 175 1 and EG1 1063 on S. exigua and 
hybrid endotoxin construction, the activity oi but i id 

H lea were compared (TABLE 7,. The da. clearly show that hybrid 5-endorox.n 

^endotoxins. In this example, the exchange site in the EG 1 .75, Endotoxin was move4 
75 base pairs V compared to the EG! 1063 o-endo.oxi» and results in improved 
insecticida. activity. Although no significant improvement in S.e^ua actmty ts 
observed between EG1 1063 and EG11751. a significant improvement in H. zea achv, y 
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of ata.cs, 4-fold is observed for EG 1 1751. ft Is import to note that irnprovemenrs rn _ 
hybrid S-endoroxin bioactivty by altering exchange sites is unpredictable. In .be ease of 
BOHOA moving the exchange she 63. base. of the EG«0b3 e « site 
abolishes insecticidal activity as shown in TABLE 5. 

TABLE 7 
Bioactivity of EG11063 and EG11751 



B.t. Strain 



EG11063 
EG 11751 



LC 50 Valuesfor Washed Sporula ted Cultures 

H.zea 



S. exigua 



106 
90 



38 
10 



6.5 



Example 5 — Amino Acid Sequences of the Novel Crystal 
651 Amino Acid Sequence of the EG11063 Crystal Protein (SEQ ID NO:10) 

MetAspAsnAsnProAsnlleAsnGluCysIlePro^rAsnCysLeu 

SerAsnProGluValGluValLeuGlyGlyGluArglleGluThrGly 

TyrThrProIleAspIleSerLeuSerLeuThrGlnPheLeuLeuSer 

GluPheValProGlyAlaGlyPheValLeuGlyLeuValAspIlelle 

TrpGlyllePheGlyProSerGlnTrpAspAlaPheLeuValGlnlle 

GluGlnLeuIleAsnGlnArglleGluGluPheAlaArgAsnGlnAla 

IleSerArgLeuGluGlyLeuSerAsnLeuTyrGlnlleTyrAlaGlu 

SerPh^ArgGluTrpGluAlaAspProThrAsnProAlaLeuArgGlu 

GluMetArglleGlnPheAsnAspMetAsnSerAlaLeuThrThrAla 

IleProLeuPheAlaValGlnAsnTyrGlnValProLeuLeuSerVal 

TyrValGlnAlaAlaAsnLeuHisLeuSerValLeuArgAspValSer 

ValPheGlyGlnArgTrpGlyPheAspAlaAlaThflleAsnSerArg 

TyrAsnAspLeuThrArgLeuIleGlyAsnTyrThrAspTyrAlaVal 

ArgTrpTyrAsnThrGlyLeuGluArgValTrpGlyProAspSerArg 

AspTrpValArgTyrAsnGlnPheArgArgGluLeuThrLeuThrVal 

LeuAspIleValAlaLeuPheProAsnTyrAspSerArgArgTyrPro 

IleArgThrValSerGlnLeuThrArgGluIleTyrThrAsnProVal 
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LeuGluAsnPheAspGlySerPheArgGlySerAlaGlnGlyileGlu 

ArgSerlleArgSerProHisLeu^ 
IleTyrThrAspAlaHis^^ 

IleMetAlaSerProValGlyPlieS^rGlyProgiuPheThrBfeePro 

LeuTyrGlyThrMefcGl^^ . 
GlnLeuGlyGlnGlyValTyrArgThrLeuSerSezTThrLeuiyrArg.. 

ArgProPheAsnlleGlylleAsnAsnGlnGlnLeuSerValLeuAsp 

GlyThrGluPheAlaTyrGlyThrSerSerAsnLeuProSerAlaVal ? 

TyrArgLysSerGlyThrValAspSerLeuAspGluIleProProGln 

AsnAsnAsnValProProArgGlnGlyPheSerHisArgLeuSerHis 

ValSerMetPheArgSerGlyPheSerAsnSerSerValSerllelle 

ArgAlaProMetPheSerTrpThrHisArgSerAlaThrProThrAsn 

ThrlleAspProGluArglleThrGlnlleProLeuValLysAlaHis 

ThrLeuGlnSerGlyThrThrValValArgGlyProGlyPheThrGly. 

GlyAspIleLeuArgArgThrSerGlyGlyProPheAlaTyrThrlle 

ValAsnlleAsnGlyGlnLeuProGlnArgTyrArgAlaArglleArg 

TVrAlaSerThrThrAsnLeuArglleTyrValThrValAlaGlyGlu 

ArgllePheAlaGlyGlnPheAsnLysThrMetAspThrGlyAspPro 

LeuThrPheGlnSerPheSerTyrAlaThrlleAsnThrAlaPheThr 

PheProMetSerGlnSerSerPheThrValGlyAlaAspThrPheSer 

SerGlyAsnGluValTyrlleAspArgPheGluLeuIleProValThr 

AlaThrPheGluAlaGluTyrAspLeuGluArgAlaGlnLysAlaVal 

AsnAlaLeuPheThrSerlleAsnGlnlleGlylleLysThrAspVal 

ThrAspTyrHisIleAspGlnValSerAsnLeuValAspCysLeuSer 

AspGluPheCysLeuAspGluLysArgGluLeuSerGluLysValLys 

HisAlaLysArgLeuSerAspGluArgAsnLeuLeuGlnAspProAsn 

PheLysGlylleAsnArgGlnLeuAspArgGlyTrpArgGlySerThr 

AspIleThrlleGlnArgGlyAspAspValPheLysGluAsnTyrVal 

ThrLeuProGlyThrPheAspGluCysTyrProThrTyrLeuTyrGln 

LysIleAspGluSerLysLeuLysAlaPheThrArgTyrGlnLeuArg 

GlyTyrlleGluAspSerGlnAspLeuGluIleTyrLeuIleArgTyr 

AsnAlaLysHisGluThrValAsnValProGlyThrGlySerLeuTrp 

ProLeuSerAlaGlnSerProIleGlyLysCysGlyGluProAsnArg 
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CysAlaProHisLeuGluTrpAsnProAspLeuAspCys 
AspGlyGluLysCysAiaHisHisSerH^^ 

AspimlGlyCysto^ ^ 

PheLeuGluGluLysProLe^^ < 

ArgAlaGluLysLysTrpArgAspLysA^ 

ThrAsnlleValTyrLysGluAlaLysGluSerValAspAlaL^uPhe 

ValAsnSerGln^rAspGlnLeuGlnAlaAspThrAsnlleAlaMet 

IleHisAlaAlaAspLysArgValHisSerlleArgGluAlaiyrLeu 

ProGluLeuSerVallleProGlyValAsnAlaAlallePheGluGlu 

LeuGluGlyArgllePheThrAlaPheSerLeuTyrAspAlaArgAsn 

VallleLysAsnGlyAspPheAsnAsnGlyLeuSerCysTrpAsnVal 

LysGlyHlsValAspValGluGluGlnAsnAsnGlnArgSerValLeu 

ValValProGluTrpGluAlaGluValSerGlnGluValArgValCys 

ProGlyArgGlyryrlleLeuArgValThrAlaTyrLysGluGlyTyr 

GlyGluGlyCysValThrlleHisGluIleGluAsnAsnThrAspGlu 

LeuLysPheSerAsnCysValGluGluGluIleTyrProAsnAsn^r 

ValThrCysAsnAspTVrThrValAsnGlnGluGluTyrGlyGlyAla 

TyrThrSerArgAsnArgGlyTyrAsnGluAlaProSerValProAla 

AspTyrAlaSerValTyrGluGluLysSerTyrThrAspGlyArgArg 

GluAsnProCysGluPheAsnArgGlyTyrArgAspTyrThrProLeu 

ProValGlyTyrValThrLysGluLeuGluTyrPheProGluThrAsp 

LysValTrpIleGluIleGlyGluThrGluGlyThrPhelleValAsp 

SerVaJjGluLeuLeuLeuMetGluGlu 

AMINO ACID SEQUENCE OF THE EG11074 CRYSTAL PROTEIN (SEQ ID NO 

MetAspAsnAsnProAsnlleAsnGluCysIleProTyrAsnCysLeu 

SerAsnProGluValGluValLeuGlyGlyGluArglleGluThrGly 
TyrThrProIleAspIleSerLeuSerLeuThrGlnPheLeuLeuSer 
GluPheValProGlyAlaGlyPheValLeuGlyLeuValAspIlelle 
TrpGlyllePheGlyProSerGlnTrpAspAlaPheLeuValGlnlle 
GluGlnLeuIleAsnGlnArglleGluGluPheAlaArgAsnGlnAla 
IleSerArgLeuGluGlyLeuSerAsnLeuTyrGlnlleTyrAiaGlu 
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^ 3 



SerPheArgGluTrpGluAlaAspProThrAsnProAlaLeuArgGlu 
GluMetArglleGlnPheAsnAspMetAsnSerAlaLeuThrThrAla 
IleProLeuPheAlaValGlnAsnTyrGlnValProLeuLeuSer\?al . 

TyrValGlnAlaAlaAsn^^ 
ValPheGlyGlnArgTrpGlyPheAspAlaAla^ 

TyrAsnAspLeuThrArgLeuI^ 

ArgTrpTyrAsnThrGlyLeuGluArgValTrpGlyProAspSerArg 
AspTrpValArgTyrAsnGlnPheArgArgGluLeuThrLeuThrVal 

LeuAspI 1 eValAlaLeuPheProAsnTyr AspSerArgArgTyrPro 

IleArgThrValSerGlnLeuThrArgGluIleTyrThrAsnProVal 

LeuGluAsnPheAspGlySerPheArgGlySerAlaGlnGlylleGlu 

ArgSerlleArgSerProHisLeuMetAspIleLeuAsnSerlleThr 

IleTyrThrAspAlaHisArgGlyTyrTyrTyrTrpSerGlyHisGln 

IleMetAlaSerProValGlyPheSerGlyProGluPheThrPhePro. 

LeuTyrGlyThrMetGlyAsnAlaAlaProGlnGlnArglleValAla 

GlnLeuGlyGlnGlyValTyrArgThrLeuSerSerThrLeuTyrArg 

ArgProPheAsnlleGlylleAsnAsnGlnGlnLeuSerValLeiiAsp 

GlyThrGluPheAlaTyrGlyThrSerSerAsnLeuProSerAlaVal 

TyrArgLysSerGlyThrValAspSerLeuAspGluIleProProGln 

AsnAsnAsnValProProArgGlnGlyPheSerHisArgLeuSerHis 

ValSerMetPheArgSerGlyPheSerAsnSerSerValSerllelle 

ArgAlaProMetPheSerTrpThrHisArgSerAlaThrProThrAsn 

ThrlleAspProGluArglleThrGlnlleProLeuValLysAlaHis 

ThrLeuQlnSerGlyThrThrValValArgGlyProGlyPheTErGly 

GlyAspIleLeuArgArgThrSerGlyGlyProPheAlaTyrThrlle 

ValAsnlleAsnGlyGlnLeuProGlnArgTyrArgAlaArglleArg 

TyrAlaSerThrThrAsnLeuArglleTyrValThrValAlaGlyGlu 

ArgllePheAlaGlyGlnPheAsnLysThrMetAspThrGlyAspPro 

LeuThrPheGlnSerPheSerTyrAlaThrlleAsnThrAlaPheThr 

PheProMetSerGlnSerSerPheThrValGlyAlaAspThrPheSer 

SerGlyAsnGluValTyrlleAspArgPheGluLeuIleProValThr 

AlaThrLeuGluAlaGluTyrAsnLeuGluArgAlaGlnLysAlaVal 

AsnAlaLeuPheThrSerThrAsnGlnLeuGlyLeuLysThrAsnVal 
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ThrAspTyrHisIleAspGlnValSerAs^ 
AspGluPheCysLeuAspGluLysArgGluLeuSer^uLysValLys 

HisAia'LysArgLeuSerAspGluArgAspL^uLeu^ ' 

PkeLysAspIleA>nA^^ , : 
GlylleThrlleGinGlyGlyAspAspy^ ; ' 

^hrLeuSer^ 

LysIleAspGluSerLysLeuLysAlaPheThrArgTyrGlnLeuArg 

GlyTyrlleGluAspSerGlnAspLeuGluIleTyrLeuJleArgTyr 

AsnAlaLysHisGluThrValAsnV^iProGlyTKrGlySerLeuTrp 

ProLeuSerAlaGlnSerProIleGlyLysCysGlyGluProAsnArg 

CysAlaProHisLeuGluTrpAsnProAspLeuAspCysSerCysArg 

AspGlyGluLysCysAlaHisHisSerHisHisPheSerLeuAspIle 

AspValGlyCysThrAspLeuAsnGluAspLeuGlyyalTrpVallle ' 

PheLysIleLysThrGlnAspGlyHisAlaArgLeuGlyAsriLeuGlu 

PheLeuGluGluLysProLeuValGlyGluAlaLeuAlaArgValLys 

ArgAlaGluLysLysTrpArgAspLysArgGluLysLeuGluTrpGlu 

ThrAsnlleValTyrLysGluAlaLysGluSerValAspAlaLeuPhe 

ValAsnSerGlnTyrAspGlnLeuGlnAlaAspThrAsnlleAlaMet 

IleHisAlaAlaAspLysArgValHisSerlleArgGluAlaTyrLeu 

ProGluLeuSerVa 1 I 1 eProGlyValAsnAlaAlal lePheGluGlu 

LeuGluGlyArgllePheThrAlaPheSerLeuTyrAspAlaArgAsn 

VallleLysAsnGlyAspPheAsnAsnGlyLeuSerCysTrpAsnVal 

LysGlyHisValAspValGluGluGlnAsnAsnGlnArgSerValLeu 

ValValProGluTrpGluAlaGluValSerGlnGluValArgValCys 

ProGlyArgGlyTyrIleLeuArgValThrAlaTyrLysGluGiyTyr_ 

GlyGluGlyCysValThrlleHisGluIleGluAsnAsnThrAspGlu 

LeuLysPheSerAsnCysValGluGluGluIleTyrProAsnAsnThr 

ValThrCysAsnAspTyrThrValAsnGlnGluGluTyrGlyGlyAla 

TyrThrSerArgAsnArgGlyTyrAsnGluAlaProSerValProAla 

AspTyrAlaSerValTyrGluGluLysSerTyrThrAspGlyArgArg 

GluAsnProCysGluPheAsnArgGlyTyrArgAspTyrThrProLeu 

ProValGlyTyrValThrLysGluLeuGluTyrPheProGluThrAsp 

LysValTrpIleGluIleGlyGluThrGluGlyThrPhelleValAsp 
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SerValGluLeuLeuLeuMetGluGlu 



AminOAcid Sequence of the EGU735 Costal Protein (SEQ ID 

MetAspAsnAsnProAsnlleAsnGluCysIleProTyrAsnCysL^ 

SerAsnProGluValGluValLeuGlyGl^^ 
TyrThrProIleAspIleSerLeuSerLeuThrGlnPheLeuLeuSer 

GluPheValProGlyAlaGlyPheValLeuGlyLeuValAspIlelle 

TrpGlyllePheGlyProSerGlnTrpAspAlaPheLeuValGlnlle 

GluGlnLeuIleAsnGlnArglleGluGluPheAlaArgAsnGlnAla 

IleSerArgLeuGluGlyLeuSerAsnLeuTyrGlnlleTyrAlaGlu 

SerPheArgGluTrpGluAlaAspProThrAsnProAlaLeuArgGlu 

GluMetArglleGlnPheAsnAspMetAsnSerAlaLeuThrThrAla 

IleProLeuPheAlaValGlnAsnTyrGlnValProLeuLeuSerVal 

TyrValGlnAlaAlaAsnLeuHisLeuSerValLeuArgAspValSer 

ValPheGlyGlnArgTrpGlyPheAspAlaAlaThrlleAsnSerArg 

TyrAsnAspLeuThrArgLeuIleGlyAsnTyrThrAspHisAlaVal 

ArgTrpTyrAsnThrGlyLeuGluArgValTrpGlyProAspSerArg 

AspTrpIleArgTyrAsnGlnPheArgArgGluLeuThrLeuThrVal 

LeuAspIleValSerLeuPheProAsnTyrAspSerArgThrTyrPro 

IleArgThrValSerGlnLeuThrArgGluIleTyrThrAsnProVal 

LeuGluAsnPheAspGlySerPheArgGlySerAlaGlnGlylleGlu 

GlySerlleArgSerProHisLeuMetAspIleLeuAsnSerlleThr 

IleTyrThrAspAlaHisArgGlyGluTyrTyrTrpSerGlyHisGln 

IleMetAlaSerProValGlyPheSerGlyProGluPheThrPhePro 

LeuTyrGlyThrMetGlyAsnAlaAlaProGlnGlnArglleValAla 

GlnLeuGlyGlnGlyValTyrArgThrLeuSerSerThrLeuTyrArg 

ArgProPheAsnlleGlylleAsnAsnGlnGlnLeuSerValLeuAsp 

GlyThrGluPheAlaTyrGlyThrSerSerAsnLeuProSerAlaVal 

TyrArgLysSerGlyThrValAspSerLeuAspGluIleProProGln 

AsnAsnAsnValProProArgGlnGlyPheSerHisArgLeuSerHis 

ValSerMetPheArgSerGlyPheSerAsnSerSerValSerllelle 

ArgAlaProMetPheSerTrpThrHisArgSerAlaThrProTTTrAsn 
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ThrlleAspProGluArglleThrGlnlleProLeuValLysAlaHis,, 

^rLeuGlnSerGlyThrTKrValValArgGlyProGlyPheteGly 

GlyAspIleLeuArgArgThrSerGlyGlyProPheAl^rThrll^ 

TyrAlaSerThrThrAsnLeuArgl^^ 

ArgllePheAiaGlyGlnP^ . 
LeuThrPheGlnSerPheSerTyrAlaThrlleAsnThrAlaPheThr 

PheProMetSerGlnSerSerPheThrValGlyAlaAspThrPheSer 

SerGlyAsnGluValTyrlleAspArgPheGluLeulleProValThr 

AlaThrPheGluAlaGluTyrAspLeuGluArgAlaGlnLysAlaVal 

AsnAlaLeuPheThrSerlleAsnGlnlleGlylleLysThrAspVal 

ThrAspTyrHisIleAspGlnValSerAsnLeuValAspCysLeuSer ■ 

AspGluPheCysLeuAspGluLysArgGluLeuSerGluLysValLys 

HisAlaLysArgLeuSerAspGluArgAsnLeuLeuGlnAspProAsn 

PheLysGlylleAsnArgGlnLeuAspArgGlyTrpArgGlySerThr 

AspIleThrlleGlnArgGlyAspAspValPheLysGluAsnTyrVal 

ThrLeuProGlyThrPheAspGluCysTyrProThrTyrLeuTyrGln 

LysIleAspGluSerLysLeuLysAlaPheThrArgTyrGlnLeuArg 

GlyTyrlleGluAspSerGlnAspLeuGluIleTyrLeuIleArgTyr 

AsnAlaLysHisGluThrValAsnValProGlyThrGlySerLeuTrp 

ProLeuSerAlaGlnSerProIleGlyLysCysGlyGluProAsnArg 

CysAlaProHisLeuGluTrpAsriProAspLeuAspCysSerCysArg 

AspGlyGluLysCysAlaHisHisSerHisHisPheSerLeuAspIle 

AspValGlyCysThrAspLeuAsnGluAspLeuGlyValTrpVallle 

PheLysIleLysThrGlnAspGlyHisAlaArgLeuGlyAsnLeuGlu_ 

PheLeuGluGluLysProLeuValGlyGluAlaLeuAlaArgValLys 

ArgAlaGluLysLysTrpArgAspLysArgGluLysLeuGluTrpGlu 

ThrAsnlleValTyrLysGluAlaLysGluSerValAspAlaLeuPhe 

ValAsnSerGlnTyrAspGlnLeuGlnAlaAspThrAsnlleAlaMet 

IleHisAlaAlaAspLysArgValHisSerlleArgGluAlaTyrLeu 

ProGluLeuSerVallleProGlyValAsnAlaAlallePheGluGlu 

LeuGluGlyArgllePheThrAlaPheSerLeuTyrAspAlaArgAsn 

VallleLysAsnGlyAspPheAsnAsnGlyLeuSerCysTrpAsnVal 
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LysGlyHisValAspValGluGiuGlnAsnAsnGlxiArgSerValLeu 
ValValProGluTrpGluAlaGluValSefGlnGluyalArgValCys 
ProGlyArgGlyTyrlleLeuArgValT^AlaTyrLyisGluGlyTyr 
GlyGluGlyCysValThrl leHisGluIleGluAsriAsiaThr 
LeuLysPheSerAsnCysValGluGluG^ 
ValThrCysAsnAspTyrTh^^^ 

TyrThrSerArgAsnArgGlyTyrAsnGlioAlaProSerValPrbAia 
AspTyrAlaSerValTyrGluGluLysSerTyrThrAspGlyArgArg 
GluAsnProCysGluPheAsnArgGlyTyrArgAspTyrTKrProLeu 
ProValGlyTyrValThrLysGluLeuGluTyrPheProGluThrAsp 
LysValTrpIleGluIleGlyGluThrGluGlyThrPhelleValAsp 

SerValGluLeuLeuLeuMetGluGlu 

5.4 AMINO ACID SEQUENCE OF THE EG11092 CRYSTAL PROTEIN (SEQ ID NO:26) 

MetAspAsnAsnProAsnlleAsnGluCysIleProTyrAsnCysLeu 

SerAsnProGluValGluValLeuGlyGlyGluArglleGluThrGly 

TyrThrProIleAspIleSerLeuSerLeuThrGlnPheLeuLeuSer 

GluPheValProGlyAlaGlyPheValLeuGlyLeuValAspIlelle 

TrpGlyllePheGlyProSerGlnTrpAspAlaPheLeuValGlnlle 

GluGlnLeuIleAsnGlnArglleGluGluPheAlaArgAsnGlnAla 

IleSerArgLeuGluGlyLeuSerAsnLeuTyrGlnlleTyrAlaGlu 

SerPheArgGluTrpGluAlaAspProThrAsnProAlaLeuArgGlu 

GluMetArglleGlnPheAsnAspMetAsnSerAlaLeuThrThrAla 

IleProLeuPheAlaValGlnAsnTyrGlnValProLeuLeuSerVal 

TyrValGlnAlaAlaAsnLeuHisLeuSerValLeuArgAspValSer. 

ValPheGlyGlnArgTrpGlyPheAspAlaAlaThrlleAsnSerArg 

TyrAsnAspLeuThrArgLeuIleGlyAsnTyrThrAspHisAlaVal 

Ar gTrpTyr AsnThrG ly LeuG luArgVa 1 TrpG ly Pr o AspS er Ar g 
AspTrpIleArgTyrAsnGlnPheArgArgGluLeuThrLeuThrVal 
LeuAspIleValSerLeuPheProAsnTyrAspSerArgThrTyrPro 
IleArgThrValSerGlnLeuThrArgGluIleTyrThrAsnProVal 
LeuGluAsnPheAspGlySerPheArgGlySerAlaGlnGlylleGlu 
ArgSerlleArgSerProHisLeuMetAspIleLeuAsnSerlleThr 
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IleTyrThrAspAlaHisArgGlyTyrTyrTyrTrpSerGiyHisGln 
IleMetAlaSerProValGlyPheSerGlyProGiuPheThrPhePrb 

LeuTyrGlyThzMetG^^ 

' GlnLeuGlyGl^ 
5 ArgProPheAsnlleGlylleAsnAsnGlnGlnLeuSerValLeliAsp 

.. GlyThrGluPheAlaTyrGlyThrSerSerAsnLeuProSerAlaWal 

TyrArgLysSerGlyThrValAspSerLeuAspGluIleProProGln 

AsnAsnAsnValProProArgGlnGlyPheSerHisArgLeuSerHis 

. ; ValSerMetPheArgSerGlyPheSerAsnSerSerValSerllelle 

10 ArgAlaProMetPheSerTrpThrHisArgSerAlaThrProThrAs^ 
ThrlleAspProGluArglleThrGlnlleProLeuValLysAlaHis 
ThrLeuGlnSerGlyThrThrValValArgGlyPrOGlyPheThrGly 
GlyAspIleLeuArgArgThrSerGlyGlyProPheAlaTyrThrlle 
ValAsnlleAsnGlyGlnLeuProGlnArgTyrArgAlaArglleArg 

15 TyrAlaSerThrThrAsnLeuArglleTyrValThrValAiaGlyGlu 
ArgllePheAlaGlyGlnPheAsnLysThrMetAspThrGlyAspPro 
LeuThrPheGlnSerPheSerTyrAlaThrlleAsnThrAlaPheThr 
PheProMetSerGlnSerSerPheThrValGlyAlaAspThrPheSer 
SerGlyAsnGluValTyrlleAspArgPheGluLeuileProValThr 

20 AlaThrPheGluAlaGluTyrAspLeuGluArgAlaGlnLysAlaVal 
AsnAlaLeuPheThrSerlleAsnGlnlleGlylleLysThrAspVal 
ThrAspTyrHisIleAspGlnValSerAsnLeuValAspCysLeuSer 
AspGluPheCysLeuAspGluLysArgGluLeuSerGluLysVa^Lys 
HisAlaLysArgLeuSerAspGluArgAsnLeuLeuGlnAspProAsn 

25 pheLysGlylleAsnArgGlnLeuAspArgGlyTrpArgGlySerThr 
Asp ileThrIleGlnArgGlyAspAspValPheLysGluAsnTyrVal 
ThrLeuProGlyThrPheAspGluCysTyrProThrTyrLeuTyrGln 
LysIleAspGluSerLysLeuLysAlaPheThrArgTyrGlnLeuArg 
GlyTyrlleGluAspSerGlnAspLeuGluIleTyrLeuIleArgTyr 

30 AsnAlaLysHisGluThrValAsnValProGlyThrGlySerLeuTrp 
ProLeuSerAlaGlnSerProIleGlyLysCysGlyGluProAsnArg 
CysAlaProHisLeuGluTrpAsnProAspLeuAspCysSerCysArg 
AspGlyGluLysCysAlaHisHisSerHisHisPheSerLeuAspIle 
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AspValCjlyCysThrAspLeuAsnG 

PheLysIleLysThrGlnAspGlyHisAlaArgLeuGlyAsnl,euGlu 

PheLeuGluGluLysProLeuValGlyGluA^ 
ArgAlaGluLysLysTrpArgAspLysArgG^ 

ThrAsnlleVal^ 

ValAsnSerGlnTyrAspGlnLeuGlnAiaA^pThrAsni LeAla*^ 

IleHisAlaAlaAspLysArgValHisSerlleArgGluAlaTyrLeu 

ProGluLeuSerVallleProGlyValAsnAlaAlallePheGluGlu 

LeuGluGlyArgllePheThrAlaPheSerLeuTyrAspAlaArgAsn 

VallleLysAsnGlyAspPheAsnAsnGlyLeuSerCysTrpAsnVal 

LysGlyHisValAspValGluGluGlnAsnAsnGlnArgSerValLeu 

ValValProGluTrpGluAlaGluValSerGlnGluValArgValCys 

ProGlyArgGlyTyrlleLeuArgValThrAlaTyrLysGluGlylVr. 

GlyGluGlyCysValThrlleHisGluIleGluAsnAsnThrAspGlu 

LeuLysPheSerAsnCysValGluGluGluIleTyrPrbAsnAsn'Tllr 

ValThrCysAsnAspTyrThrValAsnGlnGluGluTyrGlyGlyAla 

TyrThrSerArgAsriArgGlyTyrAsnGluAlaProSerValProAla 

AspTyrAlaSerValTyrGluGluLysSerTyrThrAspGlyArgArg 

GluAsnProCysGluPheAsnArgGlyTyrArgAspTyrThrProLeu 

ProValGlyTyrValThrLysGluLeuGluTyrPheProGluThrAsp 

LysValTrpIleGluIleGlyGluThrGluGlyThrPhelleValAsp 

SerValGluLeuLeuLeuMetGluGlu 

5.5 Amino Acid Sequence of the EG11751 Crystal ProteinIsEQ EC 

MetAspAsnAsnProAsnlleAsnGluCysIleProTyrAsnCysLeu 

SerAsnProGluValGluValLeuGlyGlyGluArglleGluThrGly 
TyrThrProIleAspIleSerLeuSerLeuThrGlnPheLeuLeuSer 
GluPheValProGlyAlaGlyPheValLeuGlyLeuValAspIlelle 
TrpGlyllePheGlyProSerGlnTrpAspAlaPheLeuValGlnlle 
GluGlnLeuIleAsnGlnArglleGluGluPheAlaArgAsnGlnAla 
IleSerArgLeuGluGlyLeuSerAsnLeuTyrG.lnlleTyrAlaGlu 
SerPheArgGluTrpGluAlaAspProThrAsnProAlaLeuArgGlu 
GluMetArglleGlnPheAsnAspMetAsnSerAlaLeuThrThrAla 
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IleProLeuPheAlaValGlnAsnTyrGl.nValProLeuLeuSerVal 
TyrValGlnAlaAlaAsnLeuHisLeuSerValLeuArgAspValSer 
ValPheGlyGlnArgTrpGlyPheAspAlaAlathrlleAshSerAr^ 

Tyr AsnAspLeuMrArgfceuI aeGi^^i#^^.^A^\fea 
ArgTrpTyrAsnThrGlyLeuGluArgValTtpGlyPrciAspSer 

AspTrpValArgTyrAsriGlnPheArgArgGluDeu 

LeuAspIleValAlaLeuPheProAsnTyrAspSerArgArgTyrPro 

IleArgThrValSerGlnLeuThrArgGluIleTyrThrAsnProVal 

LeuGluAsnPheAspGlySerPheArgGlySerAlaGlnGlylleGlu 

ArgSerlleArgSerProHisLeiiMetAspIleLeuAsnSerlleThr 

IleTyrThrAspAlaHisArgGlyTyrTyrTyrTrpSerGlyHisGln 

IleMetAlaSerProValGlyPheSerGlyProGluPhefhrPKePro 

LeuTyrGlyThrMetGlyAsnAlaAlaProGlnGlnArglleValAla 

GlnLeuGlyGlnGlyValTyrArgThrLeuSerSerThrLeuTyrArg 

ArgPr oPheAsnl leGlyll eAsriAsnG InG lriLeiiS erVa lLeuAsp 
GlyThrGluPheAlaTyrGlyThrSerSerAsnLeuProSerAlaVal 
TyrArgLysSerGlyThrValAspSerLeuAspGluIleProProGln 
AsnAsnAsnValProProArgGlnGlyPheSerHisArgLeuSerHis 
ValSerMetPheArgSerGlyPheSerAsnSerSerValSerllelle 
ArgAlaProMetPheSerTrpIleHisArgSerAlaGluPheAsnAsn 
IlelleAlaSerAspSerlleThrGlnlleProLeuValLysAlaHis 
ThrLeuGlnSerGlyThrThrValValArgGlyProGlyPheThrGly 
G lyAsp I 1 eLeuAr gAr gThr Ser GlyG ly Pr o PheAl aTy r Thr I 1 e 
ValAsnlleAsnGlyGlnLeuProGlnArgTyrArgAlaArglleArg 
TyrAlaSerThrThrAsnLeuArglleTyrValThrValAlaGlyGlu 
ArgllePheAlaGlyGlnPheAsnLysThrMetAspThrGlyAspPro 
LeuThrPheGlnSerPheSerTyrAlaThrlleAsnThrAlaPheThr 
PheProMetSerGlnSerSerPheThrValGlyAlaAspThrPheSer 
SerGlyAsnGluValTyrlleAspArgPheGluLeuIleProValThr 
AlaThrPheGluAlaGluTyrAspLeuGluArgAlaGlnLysAlaVal 
AsnAlaLeuPheThrSerlleAsnGlnlleGlylleLysThrAspVal 
ThrAspTyrHisIleAspGlnValSerAsnLeuValAspCysLeuSer 
AspGluPheCysLeuAspGluLysArgGluLeuSerGluLysValLys 
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HisAlaLysArgLeuSerAspGluArgAsnLeuLei;Gl^spPrpAsn 

PheLysGlylleAsnArgGlnL^ 
AspIleThrlleGlnArgGlyAspAspValPheLy^ 

ThrLeuPxoGiyTfirpkeAspGi^ 
5 LysIleAspGluSerLysLeuLysAlaPheThrArgTyrGlnneUArg 

GlyTyrlleGluAspSerGlnAspL^ 

AsnAlaLysHisGluThrValAsnValProGlyThrGlySerLeuTrp 
ProLeuSerAlaGlnSerProIleGlyLysCysGlyGluProAsnArg 
CysAlaProHisLeuGluTrpAsnProAspLeuAspCysSerCysArg 

10 AspGlyGluLysCysAlaHisHisSerHisHisPheSerLeuAspIle 
AspValGlyCysThrAspLeuAsnGluAspLeuGlyValTrpVallle 
PheLysIleLysThrGlnAspGlyHisAlaArgLeuGlyAsnLeuGlu 
PheLeuGluGluLysProLeuValGlyGl\aAlaLeuAlaArgValLys 
ArgAlaGluLysLysTrpArgAspLysArgGluLysLeuGluTrpGlu 

15 ThrAsnlleValTyrLysGluAlaLysGluSerValAspAlaLeuPlie 
ValAsnSerGlnTyrAspGlnLeuGlnAlaAspThrAsnlleAlaMet 
IleHisAlaAlaAspLysArgValHisSerlleArgGluAlaTyrLeu 
ProGluLeuSerVallleProGlyValAsnAlaAlallePheGluGlu 
LeuGluGlyArgllePheThrAlaPheSerLeuTyrAspAlaArgAsn 

20 vallleLysAsnGlyAspPheAsnAsnGlyLeuSerCysTrpAsnVal 
LysGlyHisValAspValGluGluGlnAsnAsnGlnArgSerValLeu 
ValValProGluTrpGluAlaGluValSerGlnGluValArgValCys 
ProGlyArgGlyTyrlleLeuArgValThrAlaTyrLysGluGlyTyr 
GlyGluGlyCysValThrlleHisGluIleGluAsnAsnThrAspGlu 

25 LeuLysPheSerAsnCysValGluGluGluIleTyrProAsnAsnThr^ 
ValThrCysAsnAspTyrThrValAsnGlnGluGluTyrGlyGlyAla 
TyrThrSerArgAsnArgGlyTyrAsnGluAlaProSerValProAla 
AspTyrAlaSerValTyrGluGluLysSerTyrThrAspGlyArgArg 
GluAsnProCysGluPheAsnArgGlyTyrArgAspTyrThrProLeu 

30 p r oValGlyTyrValThrLysGluLeuGluTyrPheProGluThrAsp 
LysValTrpIleGluIleGlyGluThrGluGlyThrPhelleValAsp 
SerValGluLeuLeuLeuMetGluGlu 
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.6 Amino Acid Sequence of the EG11090 Crystal Protein (SEQ ID NO:30) 

MetAspAsnAsnProAsnlleAsnGluCysIleProTy^ ; . , 

LeuSer AsrtPrqGluValGluVall^ ■ • f - , ■ 

ThrGlyTyrThrProIleAspIleSerLeuS^ - K r\ . ■ : . 

LeuLeuSerGluPheValProGlyAlaGW ' ,. ".: Y- 

... .valAspIlelleTrpGlyllePheGlyProSerGlnTrpAspAla 
PheLeuValGlnlleGluGlnLeuIleAsnGlnArglXeGluGlu 
PheAlaArgAsnGlnAlalleSerArgLeuGluGlyL'euSerAsn 
LeuTyrGlnlleTyrAlaGluSerPheArgGluTrpGluAlaAsp 
ProThrAsnProAlaLeuArgGluGluMetArglleGlnPheAsn 
AspMetAsnSerAlaLeuThrThrAlalleProLeuPheAlaVal 
GlnAsnTyrGlnValProLeuLeuSerValTyrValGlnAlaAla 
AsnLeuHisLeuSerValLeuArgAspValSerValPheGlyGln 
ArgTrpGlyPheAspAlaAlaThrlleAsnSerArgTyrAsnAsp 
LeuThrArgLeuIleGlyAsnTyrThrAspTyrAlaValArgTrp 
TyrAsnThrGlyLeuGluArgValTrpGlyProAspSerArgAsp 
TrpValArgTyrAsnGlnPheArgArgGluLeuThrLeuThrVal 
LeuAspIleValAlaLeuPheProAsnTyrAspSerArgArgTyr 
ProIleArgThrValSerGlnLeuThrArgGluIleTyrThrAsn 
ProValLeuGluAsnPheAspGlySerPheArgGlySerAlaGln 
GlylleGluArgSerlleArgSerProHisLe-uMetAspIleLeu 
AsnSerlleThrlleTyrThrAspAlaHisArgGlyTyrTyrTyr 
TrpSerGlyHisGlnlleMetAlaSerProValGlyPheSerGly 
ProGluPheThrPheProLeuTyrGlyThrMetGlyAsnAlaAla _ 
ProGlnGlnArglleValAlaGlnLeuGlyGlnGlyValTyrArg 
ThrLeuSerSerThrLeuTyrArgArgProPheAsnlleGlylle 
AsnAsnGlnGlnLeuSerValLeuAspGlyThrGluPheAlaTyr 
GlyThrSerSerAsnLeuProSerAlaValTyrArgLysSerGly 
ThrValAspSerLeuAspGluIleProProGlnAsnAsnAsnVal 
ProProArgGlnGlyPheSerHisArgLeuSerHisValSerMet 
PheArgSerGlyPheSerAsnSerSerValSerllelleArgAla 
ProMetPheSerTrpIleHisArgSerAlaThrLeuThrAsnT'Br 
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I leAspProGluArgl leAsnGlnl lePf oLeuValLy^GjLyPtte 
ArgValTrpGlyGlyThrSerVallleThrGlyProGiyPKeThr 
GlyGlyAspIleLeuArgA^gAsnT|r f PheGlyAspPhey;a 

LeuGiri^ ' 
ArgPheArgTyrAlaSerSerArgAspAlaArgVallleValLeu 

Tk^lyAlaAla^erThrGly^aiGlyGi-|G ; ; 

MetProLeuGlnLysThrMetGluIleGlyGluAsnLeuThrSer 

ArgThrPheArgTyrThrAspPheSerAsnProPheSerPheArg 

AlaAsnProAspilelleGlyHeSerGluGlriProDeuPheGly 

AlaGlySerlleSerSerGlyGluLeuTyrlleAspLysIleGlu 

IlelleLeuAlaAspAlaThrPheGluAlaGluSerAspLeuGlu 

ArgAlaGlnLysAlaValAsnAlaLeuPheThrSerSerAsnGln 

1 1-eGlyLeuLysThr AspValThrAspTyrHis I leAspGlnVal 

SerAsnLeuValAspCysLeuSerAspGluPheCysLeuAspGlu 

LysArgGluLeuSerGluLysValLysHisAlaLysArgLeuSer 

AspGlxiArgAsnLeuLeuGlnAspProAsnPheArgGlylieAsn 

ArgGlnProAspArgGlyTrpArgGlySerThrAspIleThrlle 

GlnGlyGlyAspAspValPheLysGluAsnTyrValThrLeuPro 

GlyThrValAspGluCysTyrProThrTyrLeuTyrGlnLysIle 

AspGluSerLysLeuLysAlaTyrThrArgTyrGluLeuArgGly 

TyrlleGluAspSerGlnAspLeuGluIleTyrLeuIleArgTyr 

AsnAlaLysHisGluIleValAsnValProGlyThrGlySerLeu 

TrpProLeuSerAlaGlnSerProIleGlyLysCysGlyGluPro 

AsnArgCysAlaProHisLeuGluTrpAsnProAspLeuAspCys 

SerCysArgAspGlyGluLysCysAlaHisHisSerHisHisPhe 

ThrLeuAspIleAspValGlyCysThrAspLeuAsnGluAspLeu 

GlyValTrpVa 1 1 1 ePheLys 1 1 eLysThrGlnAspGlyHi s Ala 
ArgLeuGlyAsnLeuGluPheLeuGluGluLysProLeuLeuGly 
GluAlaLeuAlaArgValLysArgAlaGluLysLysTrpArgAsp 
LysArgGluLysLeuGlnLeuGluThrAsnlleValTyrLysGlu 
AlaLysGluSerValAspAlaLeuPheValAsnSerGlnTyrAsp 
ArgLeuGlnValAspThrAsnlleAlaMetlleHisAlaAlaAsp 
LysArgValHisArglleArgGluAlaTyrLeuProGluLeuSer 
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Vallle ProGlyVa lAsnAlaAlal le PheG luGluLeiiGluGly . 
Ar g lie PheThr Al aTyr Ser LeiiTyr AspA 1 aAr gAsrlValT le 
LysAsnGlyAspPheAsriAsnGlyLeuLeuCys^rpA-snValLys •;; : ; , V ; ? ; 
GlyHisVaiAspValGLuGfuGlnAsnAsnHlsArgSerValLeu 
5 VallleProGluTrpGluAlaGluValSerGlnGluValArgVal .V. -.- ; vl V 

CysProGlyArgGiyfW . ;i ■ ■„,^.' : „ 

GlyTyrGlyGluGlyCysValThrlleHisGlutleGluAspAsn 
ThrAspGluLeuLysPheSerAsnCysValGluGluGluValTyr 
• ProAsnAsnThrValThrCysAsnAs-nTyrThrGlyThrGlnGlu 

10 QluTyrGluGlyThrTyrThrSerArgAsnGlnGlyTyrAspGlu 
AlaTyrGlyAsnAsnProSerValProAlaAspTyrAlaSerVal 
TyrGluGluLysSerTyrfhrAspGlyArgArgGluAsnProCys 
GluSerAsnArgG.lyTyrGlyAspTyrThrProLeuProAlaGly 
TyrValThrLysAspLeuGluTyrPheProGluThrAspLysVal 

15 TrpIleGluIleGlyGluThrGluGlyThrPhelleValAspSer 
ValGluLeuLeuLenMetGluGlu 

6.6 Example 6 - DNA Sequences Encoding the Novel Crystal Proteins 
6.6.1 DNA Sequence Encoding the EG11063 Crystal Protein (SEQ ID -\ t O:9) 

20 ATG GAT AAC AAT CCG AAC ATC AAT GAA 

AGT AAC CCT GAA GTA GAA GTA TTA GGT 

TAC ACC CCA ATC GAT ATT TCC TTG TCG 

GAA TTT GTT CCC GGT GCT GGA TTT GTG 

TGG GGA ATT TTT GGT CCC TCT CAA TGG 
25 GAA CAG TTA ATT AAC CAA AGA ATA GAA 

ATT TCT AGA TTA GAA GGA CTA AGC AAT 

TCT TTT AGA GAG TGG GAA GCA GAT CCT 

GAG ATG CGT ATT CAA TTC AAT GAC ATG 

ATT CCT CTT TTT GCA GTT CAA AAT TAT 
30 TAT GTT CAA GCT GCA AAT TTA CAT TTA 

GTG TTT GGA CAA AGG TGG GGA TTT GAT 

TAT AAT GAT TTA ACT AGG CTT ATT GGC 

CGC TGG TAC AAT ACG GGA TTA GAA CGT 

GAT TGG GTA AGG TAT AAT CAA TTT AGA 
35 TTA GAT ATC GTT GCT CTG TTC CCG AAT 

ATT CGA ACA GTT TCC .CAA TTA ACA AGA 



TGC 


ATT 


CCT 


TAT 


AAT 


TGT 


TTA 


48 


GGA 


GAA 


AGA 


ATA 


GAA 


ACT 


GGT 


96 


CTA 


ACG 


CAA 


TTT 


CTT 


TTG 


AGT 


144 


TTA 


GGA 


CTA 


GTT 


GAT 


ATA 


ATA 


192 


GAC 


GCA 


TTT 


CTT 


GTA 


CAA_ 


_ATT 


240 


GAA 


TTC 


GCT 


AGG 


AAC 


CAA 


GCC 


288 


CTT 


TAT 


CAA 


ATT 


TAC 


GCA 


GAA _ 


336 


ACT 


AAT 


CCA 


GCA 


TTA 


AGA 


GAA 


384 


AAC 


AGT 


GCC 


CTT 


ACA 


ACC 


GCT 


432 


CAA 


GTT 


CCT 


CTT 


TTA 


TCA 


GTA 


480 


TCA 


GTT 


TTG 


AGA 


GAT 


GTT 


TCA 


528 


GCC 


GCG 


ACT 


ATC 


AAT 


AGT 


CGT 


576 


AAC 


TAT 


ACA 


GAT 


TAT 


GCT 


GTA 


524 


GTA 


TGG 


GGA 


CCG 


GAT 


TCT 


AGA 


572 


AGA 


GAA 


TTA 


ACA 


CTA 


ACT 


GTA 


720 


TAT 


GAT 


AGT 


AGA 


AGA 


TAT CCA 


768 


GAA 


ATT 


TAT 


ACA 


AAC 


CCA 


GTA 


816 
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TTA GAA AAT TTT GAT GGT AGT TTT CGA GGC TCG GOT CAG GGC ATA GAA 
AGA AGT ATT AGG AGT CCA CAT TTG ATG GAT ATA CTT AAC AGT AT A ACC 
ATC TAT ACG GAT GCT CAT AGG GGT TAT TAT TAT TGG TCA GGG CAT CAA" 
ATA ATG GCT TCT CCT GTA GGG TTT TCG GGG CCA GAA TTC ' ACT TTT CCG 
5 CTA TAT GGA ACT ATG GGA AAT GCA GCT CCA CAA • CAA CGT ATT A "GTT - GCT, 1056 

CAA CTA GGT CAG GGC GTG TAT AGA ACA TTA TCG TCC ACT TTA TAT. AGA 
AGA CCT TTT AAT ATA GGG ATA AAT AAT CAA CAA CTA • TCT GT| CTT GAC 
GGG ACA GAA TTT GCT TAT GGA ACC TCC TCA AAT TTG CCA TCC GCT GTA 
TAG AGA AAA AGC GGA ACG GTA GAT TCG CTG GAT GAA ATA CCG CCA CAG 
10 AAT AAC AAC GTG CCA CCT AGG CAA GGA TTT AGT CAT CGA TTA AGC CAT 

GTT TCA ATG TTT CGT TCA GGC TTT AGT AAT AGT AGT GTA AGT ATA ATA 
AGA GCT CCA ATG TTT TCT TGG ACG CAC CGT AGT GCA ACC CCT ACA AAT 
ACA ATT GAT CCG GAG AGG ATT ACT CAA ATA CCA TTG GTA AAA GCA CAT 
ACA CTT CAG TCA GGT ACT ACT GTT GTA AGA GGG CCC GGG TTT ACG GGA 
15 GGA GAT ATT CTT CGA CGA ACA AGT GGA GGA CCA TTT GCT TAT ACT ATT 

GTT AAT ATA AAT GGG CAA TTA CCC CAA AGG TAT CGT GCA AGA ATA CGC 
TAT GCC TCT ACT ACA AAT CTA AGA ATT TAC GTA ACG GTT GCA GGT GAA 
CGG ATT TTT GCT GGT CAA TTT AAC AAA ACA ATG GAT ACC GGT GAC CCA 1680 
TTA ACA TTC CAA TCT TTT AGT TAC GCA ACT ATT AAT ACA GCT TTT ACA 1728 
20 TTC CCA ATG AGC CAG AGT AGT TTC ACA GTA GGT GCT GAT ACT TTT AGT 

TCA GGG AAT GAA GTT TAT ATA GAC AGA TTT GAA TTG ATT CCA GTT ACT 
GCA ACA TTT GAA GCA GAA TAT GAT TTA GAA AGA GCA CAA AAG GCG GTG 
AAT GCG CTG TTT ACT TCT ATA AAC CAA ATA GGG ATA AAA ACA GAT GTG 
ACG GAT TAT CAT ATT GAT CAA GTA TCC AAT TTA GTG GAT TGT TTA TCA 
25 GAT GAA TTT TGT CTG GAT GAA AAG CGA GAA TTG TCC GAG AAA GTC AAA 2016 

CAT GCG AAG CGA CTC AGT GAT GAG CGG AAT TTA CTT CAA GAT CCA AAC 
TTC AAA GGC ATC AAT AGG CAA CTA GAC CGT GGT TGG AGA GGA AGT ACG 
GAT ATT ACC ATC CAA AGA GGA GAT GAC GTA TTC AAA GAA AAT TAT GTC. 
ACA CTA CCA GGT ACC TTT GAT GAG TGC TAT CCA ACA TAT TTG TAT CAA 
30 AAA ATC GAT GAA TCA AAA TTA AAA GCC TTT ACC CGT TAT CAA TTA AGA 

GGG TAT ATC GAA GAT AGT CAA GAC TTA GAA ATC TAT TTA ATT CGC TAC 
AAT GCA AAA CAT GAA ACA GTA AAT GTG CCA GGT ACG GGT TCC TTA TGG 
CCG CTT TCA GCC CAA AGT CCA ATC GGA AAG TGT GGA GAG CCG AAT CGA 
TGC GCG CCA CAC CTT GAA TGG AAT CCT GAC TTA GAT TGT TCG TGT AGG 
35 GAT GGA GAA AAG TGT GCC CAT CAT TCG CAT CAT TTC TCC TTA GAC ATT 

GAT GTA GGA TGT ACA GAC TTA AAT GAG GAC CTA GGT GTA TGG GTG ATC 
TTT AAG ATT AAG ACG CAA GAT GGG CAC GCA AGA CTA GGG AAT CTA GAG 
TTT CTC GAA GAG AAA CCA TTA GTA GGA GAA GCG CTA GCT CGT GTG AAA 
AGA GCG GAG AAA AAA TGG AGA GAC AAA CGT GAA AAA TTG GAA TGG GAA 
40 ACA AAT ATC GTT TAT AAA GAG GCA AAA GAA TCT GTA GAT GCT TTA TTT 



1104. 
1152. 
1200 
1248 
1296 
1344 
1392 
1440 
1488 
1536 
1584 
1632 



1776 
1824 
1872 
1920 
1968 



2064 
2112 
2160 
2208 
2256 
2304 
2352 
2400 
2448 
2496 
2544 
2592 
2640 
2688 
2736 
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GTA AAC TCT CAA TAT GAT CAA TTA CAA GCG GAT ACG AAT ATT GCC ATG 27 84 

ATT CAT GCG GCA GAT AAA CGT GTT CAT AGC ATT CGA GAA GCT TAT CTG : 2832- 

CCT GAG CTG TCT GTG ATT CCG GGT GTC AAT GCG GCT ATT TTT GAA GAA 2B80. 

TTA GAA GGG CGT ATT TTC ACT GCA TTC TCC CTA. TAT GAT GCG AGA AAT' ' 2928 

GTC ATT AAA AAT GGT GAT TTT AAT AAT GGC TTA TCC TGC TGG AAC GTG 297:6 

AAA GGG CAT GTA GAT GTA GAA GAA CAA AAC AAC CAA CGT TCG GTC CTT 3024 

GTT GTT CCG GAA TGG GAA GCA GAA GTG TCA CAA GAA. GTT CGT GTC TGT 3072 

CCG GGT CGT GGC TAT ATC CTT CGT GTC ACA GCG TAC AAG GAG GGA TAT 3120 

GGA GAA GGT TGC GTA ACC ATT CAT GAG ATC GAG AAC AAT ACA GAC GAA 3168 

CTG AAG TTT AGC AAC TGC GTA GAA GAG GAA ATC TAT CCA AAT AAC ACG 3 216 

GTA ACG TGT AAT GAT TAT ACT GTA AAT' CAA GAA GAA TAC GGA GGT GCG 3264 

TAC ACT TCT CGT AAT CGA GGA TAT AAC GAA GCT CCT TCC GTA CCA GCT 3312 

GAT TAT GCG TCA GTC TAT GAA GAA AAA TCG TAT ACA GAT GGA CGA AGA 3360 

GAG AAT CCT TGT GAA TTT AAC AGA GGG TAT AGG GAT TAC ACG CCA CTA . 3408 

CCA GTT GGT TAT GTG ACA AAA GAA TTA GAA TAC TTC CCA GAA ACC GAT 3456 

AAG GTA TGG ATT GAG ATT GGA GAA ACG GAA GGA ACA TTT ATC GTG GAC 3 5 04 
AGC GTG GAA TTA CTC CTT ATG GAG GAA 



3531 



6.6.2 DNA SEQUENCE ENCODING THE EG11074 CRYSTAL PROTEIN (SEQ ID NO:ll) 


ATG 


GAT 


AAC 


AAT 


CCG 


AAC 


ATC 


AAT 


GAA 


TGC 


ATT 


CCT 


TAT 


AAT 


TGT 


TTA 


48 


AGT 


AAC 


CCT 


GAA 


GTA 


GAA 


GTA 


TTA 


GGT 


GGA 


GAA 


AGA 


ATA 


GAA 


ACT 


GGT 


96 


TAC 


ACC 


CCA 


ATC 


GAT 


ATT 


TCC 


TTG 


TCG 


CTA 


ACG 


CAA 


TTT 


CTT 


TTG 


AGT 


144 


GAA 


TTT 


GTT 


CCC 


GGT 


GCT 


GGA 


TTT 


GTG 


TTA 


GGA 


CTA 


GTT 


GAT 


ATA 


ATA 


192 


TGG 


GGA 


ATT 


TTT 


GGT 


CCC 


TCT 


CAA 


TGG 


GAC 


GCA 


TTT 


CTT 


GTA 


CAA 


ATT 


240 


GAA 


CAG 


TTA 


ATT 


AAC 


CAA 


AGA 


ATA 


GAA 


GAA 


TTC 


GCT 


AGG 


AAC 


CAA 


GCC 


288 


ATT 


TCT 


AGA 


TTA 


GAA 


GGA 


CTA 


AGC 


AAT 


CTT 


TAT 


CAA 


ATT 


TAC 


GCA 


GAA 


336 


TCT 


TTT 


AGA 


GAG 


TGG 


GAA 


GCA 


GAT 


CCT 


ACT 


AAT 


CCA 


GCA 


TTA 


AGA 


GAA 


384 


GAG 


ATG 


CGT 


ATT 


CAA 


TTC 


AAT 


GAC 


ATG 


AAC 


AGT 


GCC 


CTT 


ACA AC£_GCT 


432 


ATT 


CCT 


CTT 


Td?T GCA 


GTT 


CAA 


AAT 


TAT 


CAA 


GTT 


CCT 


CTT 


TTA 


TCA. 


GTA 


480 


TAT 


GTT 


CAA 


GCT 


GCA 


AAT 


TTA 


CAT 


TTA 


TCA 


GTT 


TTG 


AGA GAT GTT 


TCA_ 


528 


GTG 


TTT 


GGA 


CAA 


AGG 


TGG 


GGA 


TTT 


GAT 


GCC 


GCG 


ACT 


ATC 


AAT 


AGT 


CGT 


576 


TAT 


AAT 


GAT 


TTA 


ACT 


AGG 


CTT 


ATT 


GGC 


AAC 


TAT 


ACA 


GAT 


TAT 


GCT 


GTA 


624 


CGC 


TGG 


TAC 


AAT 


ACG 


GGA 


TTA 


GAA 


CGT 


GTA 


TGG 


GGA 


CCG 


GAT 


TCT 


AGA 


„ 672 


GAT 


TGG 


GTA 


AGG 


TAT 


AAT 


CAA 


TTT 


AGA 


AGA 


GAA 


TTA 


ACA 


CTA 


ACT 


GTA 


• .720 


TTA 


GAT 


ATC 


GTT 


GCT 


CTG 


TTC 


CCG 


AAT 


TAT 


GAT 


AGT 


AGA 


AGA 


TAT 


CCA 


768 


ATT 


CGA 


ACA 


GTT 


TCC 


CAA 


TTA 


ACA 


AGA 


GAA 


ATT 


TAT 


ACA 


AAC 


CCA 


GTA 


816 


TTA 


GAA 


AAT 


TTT 


GAT 


GGT 


AGT 


TTT 


CGA 


GGC 


TCG 


GCT 


CAG 


GGC 


ATA 


GAA 


864 


AGA 


AGT 


ATT 


AGG 


AGT 


CCA 


CAT 


TTG 


ATG 


GAT 


ATA 


CTT 


AAC 


AGT 


ATA 


ACC 


912 


ATC 


TAT 


ACG 


GAT 


GCT 


CAT 


AGG 


GGT 


TAT 


TAT 


TAT 


TGG 


TCA 


GGG 


CAT. CAA 


960 


ATA 


ATG 


GCT 


TCT 


CCT 


GTA 


GGG 


TTT 


TCG 


GGG 


CCA 


GAA 


TTC 


ACT 


TTT 


CCG 


1008 
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CTA TAT GGA ACT ATG GGA AAT GCA 
CAA CTA GGT CAG GGC GTG TAT AGA 
AGA CCT TTT AAT ATA GGG ATA AAT 
GGG ACA GAA TTT GCT TAT GGA ACC 
5 TAC AGA AAA AGC GGA ACG GTA^ GAT 

AAT AAC AAC GTG CCA CCT AGG CAA 
GTT TCA ATG TTT CGT TCA GGC TTT 
AGA GCT CCA ATG TTT TCT TGG ACG 
ACA ATT GAT CCG GAG AGG ATT ACT 
10 ACA CTT CAG TCA GGT ACT ACT GTT 

GGA GAT ATT CTT CGA CGA ACA AGT. 
GTT AAT ATA AAT GGG CAA TTA CCC 
TAT GCC TCT ACT ACA AAT CTA AGA 
CGG ATT TTT GCT GGT CAA TTT AAC 
15 TTA ACA TTC CAA TCT TTT AGT TAC 

TTC CCA ATG AGC CAG AGT AGT TTC 
TCA GGG AAT GAA GTT TAT ATA GAC 
GCA ACA CTC GAG GCT GAA TAT AAT 
AAT GCG CTG TTT ACG TCT ACA AAC 
20 ACG GAT TAT CAT ATT GAT CAA GTG 

GAT GAA TTT TGT CTG GAT GAA AAG 
CAT GCG AAG CGA CTC AGT GAT GAA 
TTC AAA GAC ATT AAT AGG CAA CCA 
GGG ATT ACC ATC CAA GGA GGG GAT 
25 ACA CTA TCA GGT ACC TTT GAT GAG 

AAA ATC GAT GAA TCA AAA TTA AAA 
GGG TAT ATC GAA GAT AGT CAA GAC 
AAT GCA AAA CAT GAA ACA GTA AAT 
CCG CTT TCA GCC CAA AGT CCA ATC 
30 TGC GCG CCA CAfc CTT GAA TGG AAT 

GAT GGA GAA AAG TGT GCC CAT CAT 
GAT GTA GGA TGT ACA GAC TTA AAT 
TTT AAG ATT AAG ACG CAA GAT GGG 
TTT CTC GAA GAG AAA CCA TTA GTA 
35 AGA GCG GAG AAA AAA TGG AGA GAC 

ACA AAT ATC GTT TAT . AAA GAG GCA 
GTA AAC TCT CAA TAT GAT CAA TTA 
ATT CAT GCG GCA GAT AAA CGT GTT 
CCT GAG CTG TCT GTG ATT CCG GGT 
40 TTA GAA GGG CGT ATT TTC ACT GCA 



GCT CCA CAA CAA CGT ATT GTT GQT 1056 

ACA TTA TCG TCC ACT TTA TAT AGA 1104 

AAT CAA CAA CTA TCT GTT CTT GAC 11,52 

TCC TCA AAT TTG CCA TCC .GCT . GTA 'X%0ti 

TCG CTG GAT GAA ATA CCG QCh O^G 12.4T8 
GGA TTT AGT CAT CGA TTA AGC CAT 
AGT AAT AGT AGT GTA AGT. ATA ATA /; 

CAC CGT AGT GCA ACC CCT ACA AAT 13 9:2 

CAA ATA CCA TTG GTA AAA GCA CAT 1440 

GTA AGA GGG CCC GGG TTT ACG GGA 1488 

GGA GGA CCA TTT GCT TAT ACT ATT 153 6 

CAA AGG TAT CGT GCA AGA ATA CGC 1584 

ATT TAC GTA ACG GTT GCA GGT GAA 1632 

AAA ACA ATG GAT ACC GGT GAC CCA 1680 

GCA ACT ATT AAT ACA GCT TTT ACA 1728 

ACA GTA GGT GCT GAT ACT TTT AGT 1776 

AGA TTT GAA TTG ATT CCA GTT ACT 1824 

CTG GAA AGA GCG CAG AAG GCG GTG 187 2 

CAA CTA GGG CTA AAA ACA AAT GTA 1920 

TCC AAT TTA GTT ACG TAT TTA TCG 1968 

CGA GAA TTG TCC GAG AAA GTC AAA 2016 

CGC -AAT TTA CTC CAA GAT TCA AAT 2064 

GAA CGT GGG TGG GGC GGA AGT ACA 2112 

GAC GTA TTT AAA GAA AAT TAC GTC 2160 

TGC TAT CCA ACA TAT TTG TAT CAA 220 8 

GCC TTT ACC CGT TAT CAA TTA AGA 2256 

TTA GAA ATC TAT TTA ATT CGC TAC 23 04 

GTG CCA GGT ACG GGT TCC TTA TGG 2 3 52 

GGA AAG TGT GGA GAG CCG AAT" CGA 2400 

CCT GAC TTA GAT TGT TCG TGT AGG 2448 

TCG CAT CAT TTC TCC TTA GAC ATT* 2496 

GAG GAC CTA GGT GTA TGG GTG* ATC 2544 

CAC GCA AGA CTA GGG AAT CTA GAG 2592 

GGA GAA GCG CTA GCT CGT GTG AAA 2640 

AAA CGT GAA AAA TTG GAA TGG GAA 2688 

AAA GAA TCT GTA GAT GCT TTA TTT 273 6 

CAA GCG GAT ACG AAT ATT GCC ATG 27 84 

CAT AGC ATT CGA . GAA GCT TAT CTG 2 83 2 

GTC AAT GCG GCT ATT TTT GAA GAA 28 80 

TTC TCC CTA TAT GAT GCG AGA: AAT 29 28 
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GTC ATT AAA AAT GGT GAT TTT AAT AAT .GGC TTA TCC TGC TGG AAC GTG 297,6 

AAA GGG CAT GTA GAT GTA GAA GAA CAA AAC AAC CAA CGT TCG GTC CTT 3024 

GTT GTT CCG GAA TGG GAA GCA GAA GTG TCA CAA GAA GTT .CGT GTC TGT 3072' 

CCG GGT CGT GGC TAT ATC CTT- CGT GTC ACA ; GC(3 TAC AA^' f AG->G^ ; TAT i ^f^',. 

•GGA GAA GGT -TGC GTA ACC ATT CAT GAG ; P TC GAG AAC AAT ACA^GAC GAA- 3 16 8-, - 

CTG AAG TTT AGC AAC TGC GTA GAA GAG .GAA ATC TAT CCA AAT. AAC ACG - ■ 

GTA ACG TGT AAT GAT TAT ACT GTA -AAT . CAA GAA GAA TAC GGA -GGT' GCG . 326| . ; 

TAC ACT TCT CGT AAT CGA GGA TAT AAC GAA GCT CCT TCC GTA CCA GCT 3 3 12 



GAT 


TAT 


GCG 


TCA 


GTC 


TAT 


GAA 


GAA 


AAA 


TCG 


TAT 


ACA 


GAT 


GGA 


CGA 


AGA 


3360 


GAG 


AAT 


CCT 


TGT 


GAA 


TTT 


AAC 


AGA 


GGG 


TAT 


AGG 


GAT 


TAC 


ACG 


CCA CTA 


3408 


CCA 


GTT 


GGT 


TAT 


GTG 


ACA 


AAA GAA TTA GAA TAC TTC CCA GAA ACC 


GAT' 


3456 


AAG 


GTA 


TGG 


ATT 


GAG 


ATT 


GGA 


GAA 


ACG 


GAA 


GGA 


ACA 


TTT 


ATC 


GTG 


GAC 


3504 


AGC 


GTG 


GAA 


TTA 


CTC 


CTT 


ATG 


GAG 


GAA 
















3531 


6.6.3 DNA Sequence Encoding the EG11735 Crystal Protein (S: 


EQIDNO:12 


ATG 


GAT 


AAC 


AAT 


CCG 


AAC 


ATC 


AAT 


GAA 


TGC 


ATT 


CCT 


TAT 


AAT 


TGT 


TTA 


A Q 


AGT 


AAC 


CCT 


GAA 


GTA 


GAA 


GTA 


TTA 


GGT 


GGA 


GAA 


AGA 


ATA 


GAA 


ACT 


GGT 




TAC 


ACC 


CCA 


ATC 


GAT 


ATT 


TCC 


TTG 


TCG 


CTA 


ACG 


CAA 


TTT 


CTT 


TTG 


AGT 


1 A A 


GAA 


TTT 


GTT 


CCC 


GGT 


GCT 


GGA 


TTT 


GTG 


TTA 


GGA 


CTA 


GTT 


GAT 


ATA 


ATA 


1 Q9 

J. Z7 Z 


TGG 


GGA 


ATT 


TTT 


GGT 


CCC 


TCT 


CAA 


TGG 


GAC 


GCA 


TTT 


CTT 


GTA 


CAA ATT 


9 A 0 


GAA 


CAG 


TTA 


ATT 


AAC 


CAA 


AGA 


ATA 


GAA 


GAA 


TTC 


GCT 


AGG 


AAC 


CAA 


GCC 


288 


ATT 


TCT 


AGA 


TTA 


GAA 


GGA 


CTA 


AGC 


AAT 


CTT 


TAT 


CAA 


ATT 


TAC 


GCA 


GAA 


336 


TCT 


TTT 


AGA 


GAG 


TGG 


GAA 


GCA 


GAT 


CCT 


ACT 


AAT 


CCA 


GCA 


TTA 


AGA 


GAA 


384 


GAG 


ATG 


CGT 


ATT 


CAA 


TTC 


AAT 


GAC 


ATG 


AAC 


AGT 


GCC 


CTT 


ACA 


ACC 


GCT 


432 


ATT 


CCT 


CTT 


TTT 


GCA 


GTT 


CAA 


AAT 


TAT 


CAA 


GTT 


CCT 


CTT 


TTA 


TCA 


GTA 


480 


TAT 


GTT 


CAA 


GCT 


GCA 


AAT 


TTA 


CAT 


TTA 


TCA 


GTT 


TTG 


AGA 


GAT 


GTT 


TCA 


528 


GTG 


TTT 


GGA 


CAA 


AGG 


TGG 


GGA 


TTT 


GAT 


GCC 


GCG 


ACT 


ATC 


AAT 


AGT 


CGT 


576 


TAT 


AAT 


GAT 


TTA 


ACT 


AGG 


CTT 


ATT 


GGC 


AAC 


TAT 


ACA 


GAT 


CAT 


GCT 


GTA 


624 


CGC 


TGG 


TAC 


AAT 


ACG 


GGA 


TTA 


GAG 


CGT 


GTA 


TGG 


GGA 


CCG 


GAT 


TCT 


AGA 


672 


GAT 


TGG 


ATA 


AGA 


TAT 


AAT 


CAA 


TTT 


AGA 


AGA 


GAA 


TTA 


ACA 


CTA 


ACT 


GTA 


720 


TTA 


GAT 


ATC 


GTT 


TCT 


CTA 


TTT 


CCG 


AAC 


TAT 


GAT 


AGT 


AGA 


ACG 


TAT 


CCA 


768 


ATT 


CGA 


ACA 


GTT 


TCC 


CAA 


TTA 


ACA 


AGA 


GAA 


ATT 


TAT 


ACA 


AAC 


CCA 


GTA 


816 


TTA 


GAA 


AAT 


TTT 


GAT 


GGT 


AGT 


TTT 


CGA 


GGC 


TCG 


GCT 


CAG 


GGC 


ATA 


GAA 


864 


GGA 


AGT 


ATT 


AGG 


AGT 


CCA 


CAT 


TTG 


ATG 


GAT 


ATA 


CTT. 


- AAC 


AGT 


ATA 


ACC 


912 


ATC 


TAT 


ACG 


GAT 


GCT 


CAT 


AGA 


GGA 


GAA 


TAT 


TAT 


TGG 


TCA 


GGG 


CAT 


CAA 


960 


ATA 


ATG 


GCT 


TCT 


CCT 


GTA 


GGG 


TTT 


TCG 


GGG 


CCA 


GAA 


TTC 


ACT 


TTT 


CCG 


1008 


CTA 


TAT 


GGA 


ACT 


ATG 


GGA 


AAT 


GCA 


GCT 


CCA 


CAA 


CAA 


CGT 


ATT 


GTT 


GCT 


1056 


CAA 


CTA 


GGT 


CAG 


GGC 


GTG 


TAT 


AGA 


ACA 


TTA 


TCG 


TCC 


ACT 


TTA 


TAT 


AGA 


1104 


AGA 


CCT 


TTT 


AAT 


ATA 


GGG 


ATA 


AAT 


AAT 


CAA 


CAA 


CTA 


TCT 


GTT 


CTT 


GAC 


1152 


GGG 


ACA 


GAA 


TTT 


GCT 


TAT 


GGA 


ACC 


TCC 


TCA 


AAT 


TTG 


CCA 


TCC 


GCT 


GTA 


1200 
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TAC AGA AAA AGC GGA ACG GTA GAT TCG- CTG GAT. GAA ATA CGG CCA CAG 12.48 

AAT AAC AAC GTG CCA CCT AGG CAA GGA TTT AGT CAT GGA TTA AGC CAT 1296 

GTT TCA ATG TTT CGT TCA GGC TTT AGT AAT AGT AGT GTA AGT ATA ATA 1344 

AGA GCT CCA ATG TTT TCT TGG ACG CAC CGT AGT. GCA ACC CCTAC^ AAT • 1393 

5 ACA ATT GAT CCG GAG AGG ATT ACT CAA ATA CCA TTG GTA AAA GCA CAT 1440 

ACA CTT CAG TCA GGT ACT ACT- .GTT GTA AGA GGG CCC GGG TTT ACG GGA. 14£I8 





GGA 


GAT 


ATT 


< 

V J. I 


CGA CGA ACA AGT GGA GGA GCA 


TTT GCT TAT ; ACT* AyT 


;153 ; 6 




GTT 


AAT 


ATA 


AAT 


GGG 


CAA 


TTA 


CCC 


CAA 


AGO 


1 Al 




Pp A 


aha 


ATA 


CGC 


1584 




TAT 


GCC 


TCT 


ACT 


ACA 


AAT 


CTA 


AGA 


ATT 


TAC 


GTA 


A PP 


prnm 
Oil 


PP A 


ncT 


GAA 


1632 


10 


CGG 


ATT 


TTT 


GCT 


GGT 


CAA 


TTT 


AAC 


AAA 


ACA 


ATG 


GAT 


A PP 




PAP 


PPA 


1680 




TTA 


ACA 


TTC 


CAA 


TCT 


TTT 


AGT 


TAC 


GCA 


ACT 


ATT 


AAT 


ACA 


ppm 


TTT 


APA 


1728 




TTC 


CCA 


ATG 


AGC 


CAG 


AGT 


AGT 


TTC 


ACA 


GTA 


GGT 


GCT 


GAT 


ACT 


TTT 




1776 




TCA 


GGG 


AAT 


GAA 


GTT 


TAT ATA GAC 


AGA 


TTT 


GAA 


TTG 


ATT 


LLA 




APT 


1824 




GCA 


ACA 


TTT 


GAA 


GCA 


GAA 


TAT 


GAT 


TTA 


GAA 


AGA 


GCA 


CAA 


AAG 


PPP 


pmp 


1872 


15 


AAT 


GCG 


CTG 


TTT 


ACT 


TCT 


ATA 


AAC 


CAA 


ATA 


GGG 


ATA 


AAA 


ACA 


PAT 




.1920 




ACG 


GAT 


TAT 


CAT 


ATT 


GAT 


CAA 


GTA 


TCC 


AAT 


TTA 


GTG 


GAT 


TGT 


rprri.A 
1 1A 




19 68 




GAT 


GAA 


TTT 


TGT 


CTG 


GAT 


GAA 


AAG 


CGA 


GAA 


TTG 


TCC 


GAG 


AAA 


GTC 


AAA . 


2016 




CAT 


GCG 


AAG 


CGA 


CTC 


AGT 


GAT 


GAG 


CGG 


AAT 


TTA 


CTT 


CAA 


GA1 


CCA 


AAC 


2064 




TTC 


AAA 


GGC 


ATC 


AAT 


AGG 


CAA 


CTA 


GAC 


CGT 


GGT 


TGG 


AGA 


GGA 


AGT 


ACG 


2112 


20 


GAT 


ATT 


ACC 


ATC 


CAA 


AGA 


GGA 


GAT 


GAC 


GTA 


TTC 


AAA 


GAA 


AAT 


TAT 


GTC 


2160 




ACA 


CTA 


CCA 


GGT 


ACC 


TTT 


GAT 


GAG 


TGC 


TAT 


CCA 


ACA 


TAT 


TTG 


TAT 


CAA 


2208 




AAA 


ATC 


GAT 


GAA 


TCA 


AAA 


TTA 


AAA 


GCC 


TTT 


ACC 


CGT 


TAT 


p a a 
CAA 


TTA 


AGA 


2256 




GGG 


TAT 


ATC 


GAA 


GAT 


AGT 


CAA 


GAC 


TTA 


GAA 


ATC 


TAT 


TTA 


A rpm 


CGC 


TAC 


2304 




AAT 


GCA 


AAA 


CAT 


GAA 


ACA 


GTA 


AAT 


GTG 


CCA 


GGT 


ACG 


pprp 


mpp 


TTA 


TGG 


2352 


25 


CCG 


CTT 


TCA 


GCC 


CAA 


AGT 


CCA 


ATC 


GGA 


AAG 


TGT 


GGA 


PAP 


ppP 


AAT 


CGA 


2400 




TGC 


GCG 


CCA 


CAC 


CTT 


GAA 


TGG 


AAT 


CCT 


GAC 


TTA 


GAT 


mprp 
i Vjr 1 


TPP. 


TGT 


AGG 


2448 




GAT 


GGA 


GAA 


AAG 


TGT 


GCC 


CAT 


CAT 


TCG 


CAT 


CAT 




rppp 


TTA 
i in 


GAC 


ATT 


2496 




GAT 


GTA 


GGA 


TGT 


ACA 


GAC 


TTA 


AAT 






L In 


GGT 


GTA 


TGG 


GTG 


ATC 


2544 




TTT 


AAG 


ATT 


A A C 


ACG 


CAA 


GAT 


GGG 


CAC 


GCA 


AGA 


CTA 


GGG 


AAT 


CT7T GAG 


2592 


30 


TTT 


CTC 


GAA 


GAG 


AAA 


CCA 


TTA 


GTA 


GGA 


GAA 


GCG 


CTA 


GCT 


CGT 


GTG 


AAA 


2640 




AGA 


GCG 


GAG 


AAA 


AAA 


TGG 


AGA 


GAC 


AAA 


CGT 


GAA 


AAA 


TTG 


GAA 


TGG 


GAA" 


2688 




ACA 


AAT 


ATC 


GTT 


TAT 


AAA 


GAG 


GCA 


AAA 


GAA 


TCT 


GTA 


GAT 


GCT 


TTA 


TTT 


2736 




GTA 


AAC 


TCT 


CAA 


TAT 


GAT 


CAA 


TTA 


CAA 


GCG 


GAT 


ACG 


AAT 


ATT 


GCC 


ATG 


2784 




ATT 


CAT 


GCG 


GCA 


GAT 


AAA 


CGT 


GTT 


CAT 


AGC 


ATT 


CGA 


GAA 


GCT 


TAT 


CTG 


2832 


35 


CCT 


GAG 


CTG 


TCT 


GTG 


ATT 


CCG 


GGT 


GTC 


AAT 


GCG 


GCT 


ATT 


TTT 


GAA 


GAA 


2880 




TTA 


GAA 


GGG 


CGT 


ATT 


TTC 


ACT 


GCA 


TTC 


TCC 


CTA 


TAT 


GAT 


GCG 


AGA 


AAT 


2928 




GTC 


ATT 


AAA 


AAT 


GGT 


GAT 


TTT 


AAT 


AAT 


GGC 


TTA 


TCC 


TGC 


TGG 


AAC 


GTG 


2976 




AAA 


GGG 


CAT 


GTA 


GAT 


GTA 


GAA 


GAA 


CAA 


AAC 


AAC 


CAA 


* CGT 


TCG 


GTC 


CTT 


3024 




GTT 


GTT 


CCG 


GAA 


TGG 


GAA 


GCA 


GAA 


GTG 


TCA 


CAA 


GAA 


GTT 


CGT 


GTC 


TGT 


3072 


40 


CCG 


GGT 


CGT 


GGC 


TAT 


ATC 


CTT 


CGT 


GTC 


ACA 


GCG 


TAC 


AAG 


GAG 


GGK 


TAT 


3120 
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GGA GAA GGT TGC GTA ACC ATT CAT GAG ATC GAG AAC AAT ACA, GAC GAA 
CTG AAG TTT AGC AAC TGC GTA GAA GAG GAA ATC TAT CCA AAT AAC ACG 
GTA ACG TGT AAT GAT TAT ACT GTA AAT CAA GAA GAA TAG GGA GGT GCG 
TAC ACT TCT CGT AAT CGA GGA TAT AAC GAA GCT CCT TCC GTA CC^ GGT 
GAT TAT GCG TCA GTC TAT GAA GAA AAA TCG TAT ACA GAT GGA CGA AGA 
GAG AAT CCT TGT GAA TTT AAC AGA GGG TAT AGG GAT TAC ACG CCA CTA 
CCA GTT GGT TAT GTG ACA AAA GAA, TTA GAA TAC TTC CCA GAA ACC GAT 
AAG GTA TGG ATT GAG ATT GGA GAA ACG GAA GGA ACA TTT ATC GTG GAC 
AGC GTG GAA TTA CTC CTT ATG GAG GAA 



31 = 
321 

32 z 
331 

•32 £ 

34: 

34; 
3= : 
352 



6.6.4 DNA Sequence Encoding the EG11092 Crystal Protein (SEQ ID NO 

ATG GAT AAC AAT CCG AAC ATC AAT GAA TGC ATT CCT TAT AAT TGT TTA 
AGT AAC CCT GAA GTA GAA GTA TTA GGT GGA GAA AGA ATA GAA ACT GGT 

TAC ACC CCA ATC GAT ATT TCC TTG TCG CTA ACG CAA TTT CTT TTG AGT - 
GAA TTT GTT CCC GGT GCT GGA TTT GTG TTA GGA CTA GTT GAT ATA ATA 

TGG GGA ATT TTT GGT CCC TCT CAA TGG GAC GCA TTT CTT GTA CAA ATT 1 

GAA CAG TTA ATT AAC CAA AGA ATA GAA GAA TTC GCT AGG AAC CAA GCC 1 
ATT TCT AGA TTA GAA GGA CTA AGC AAT CTT TAT CAA ATT TAC GCA GAA 

TCT TTT AGA GAG TGG GAA GCA GAT CCT ACT AAT CCA GCA TTA AGA GAA 2 

GAG ATG CGT ATT CAA TTC AAT GAC ATG AAC AGT GCC CTT ACA ACC GCT 4 

ATT CCT CTT TTT GCA GTT CAA AAT TAT CAA GTT CCT CTT TTA TCA GTA 4 
TAT GTT CAA GCT GCA AAT TTA CAT TTA TCA GTT TTG AGA GAT GTT TCA 
GTG TTT GGA CAA AGG TGG GGA TTT GAT GCC GCG ACT ATC AAT AGT CGT 
TAT AAT GAT TTA ACT AGG CTT ATT GGC AAC TAT ACA GAT CAT GCT GTA 
CGC TGG TAC AAT ACG GGA TTA GAG. CGT GTA TGG GGA CCG GAT TCT AGA 
GAT TGG ATA AGA TAT AAT CAA TTT AGA AGA GAA TTA ACA CTA ACT GTA 
TTA GAT ATC GTT TCT CTA TTT CCG AAC TAT GAT AGT AGA ACG TAT CCA 

ATT CGA ACA GTT TCC CAA TTA ACA AGA GAA ATT TAT ACA AAC CCA. GTA £ 

TTA GAA" AAT TTT GAT GGT AGT TTT CGA GGC TCG GCT CAG GGC ATA . GAA £ 

AGA AGT ATT AGG AGT CCA CAT TTG ATG GAT ATA CTT AAC AGT ATA ACC. 9 

ATC TAT ACG GAT GCT CAT AGG GGT TAT TAT TAT TGG TCA GGG CAT CAA 5 

ATA ATG GCT TCT CCT GTA GGG TTT TCG GGG CCA GAA TTC ACT TTT CCG 1Z 

CTA TAT GGA ACT ATG GGA AAT GCA GCT CCA CAA CAA CGT ATT GTT GCT II 

CAA CTA GGT CAG GGC GTG TAT AGA ACA TTA TCG TCC ACT TTA TAT AGA 11 

AGA CCT TTT AAT ATA GGG ATA AAT AAT CAA CAA CTA TCT GTT CTT GAC 11 

GGG ACA GAA TTT GCT TAT GGA ACC TCC TCA AAT TTG CCA TCC GCT GTA 12 

TAC AGA AAA AGC GGA ACG GTA GAT TCG CTG GAT GAA ATA CCG CCA CAG 12 

AAT AAC AAC GTG CCA CCT AGG CAA GGA TTT AGT CAT CGA TTA AGC CAT 12 

GTT TCA ATG TTT CGT TCA GGC TTT AGT AAT AGT AGT GTA AGT AIA ATA 12 

AGA GCT CCA ATG TTT TCT TGG ACG CAC CGT AGT GCA ACC CCT ACA AAT 12 
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1440 
1488 
1536 

1S84; 



ACA ATT GAT CCG GAG AGG ATT ACT GAA ATA CCA TTG GTA AAA GCA CAT 
ACA CTT CAG TCA GGT ACT ACT GTT GTA AGA GGG CCC GGG T11T ACG GGA 
GGA GAT ATT CTT CGA CGA AGA AGT GGA GGA CCA T^T;^ AQ^ 
GTT AAT ATA AAT GGG CAA TTA CCC GAA AGG TAT CGT GCA^W^. 
TAT GCC TCT ACT ACA AAT CTA AGA ATT TAC GTA ACG ^ GTT Q CA GGT .GAA 
CGG ATT TTT GCT GGT CAA TTT AAC AAA ACA ATG; GAT ACC: GGT ;G^C C£A 
TTA ACA TTC CAA TCT TTT AGT TAC GCA ACT ATT AAT ACA GCT TTT ^ AC A" 
TTC CCA ATG AGC CAG AGT AGT TTC ACA GTA GGT GCT GAT ACT TTT AGT 1776 
TCA GGG AAT GAA GTT TAT ATA GAC AGA TTT GAA TTG ATT CCA GTT ACT 
GCA ACA TTT GAA GCA GAA TAT GAT TTA GAA AGA GCA CAA AAG GCG GTG 
AAT GCG CTG TTT ACT TCT ATA AAC CAA ATA GGG ATA AAA ACA GAT GTG 
ACG GAT TAT CAT ATT GAT CAA GTA TCC AAT TTA GTG GAT TGT TTA TCA 
GAT GAA TTT TGT CTG GAT GAA AAG CGA GAA TTG TCC GAG AAA GTC AAA 
CAT GCG AAG CGA CTC AGT GAT GAG CGG AAT TTA CTT CAA GAT CCA AAC 
TTC AAA GGC ATC AAT AGG CAA CTA GAC CGT GGT TGG AGA GGA AGT ACG 
GAT ATT ACC ATC CAA AGA GGA GAT GAC. GTA TTC AAA GAA AAT TAT GTC 
ACA CTA CCA GGT ACC TTT GAT GAG TGC TAT CCA ACA TAT TTG TAT CAA 
AAA ATC GAT GAA TCA AAA TTA AAA GCC TTT ACC CGT TAT CAA TTA AGA 
GGG TAT ATC GAA GAT AGT CAA GAC TTA GAA ATC TAT TTA ATT CGC TAC 
AAT GCA AAA CAT GAA ACA GTA AAT GTG CCA GGT ACG GGT TCC TTA TGG 
CCG CTT TCA GCC CAA AGT CCA ATC GGA AAG TGT GGA GAG CCG AAT CGA 
TGC GCG CCA CAC CTT GAA TGG AAT CCT GAC TTA GAT TGT TCG TGT AGG 
GAT GGA GAA AAG TGT GCC CAT CAT TCG CAT CAT TTC TCC TTA GAC ATT 
GAT GTA GGA TGT ACA GAC TTA AAT GAG GAC CTA GGT GTA TGG GTG ATC 
TTT AAG ATT AAG ACG CAA GAT GGG CAC GCA AGA CTA GGG AAT CTA GAG 
TTT CTC GAA GAG AAA CCA TTA GTA GGA GAA GCG CTA GCT CGT GTG AAA 
AGA GCG GAG AAA AAA TGG AGA GAC AAA CGT GAA AAA TTG GAA TGG GAA 
ACA AAT ATC GTT TAT AAA GAG GCA AAA GAA TCT GTA GAT. GCT TTA TTT 
GTA AAC TCT CAA TAT GAT CAA TTA CAA GCG GAT ACG AAT ATT GG<=- ATG 
ATT CAT GCG GCA GAT AAA CGT GTT CAT AGC ATT CGA GAA GGT TAT- -CTG 
CCT GAG CTG TCT GTG ATT CCG GGT GTC AAT GCG GCT ATT TTT GAA GAA- 
TTA GAA GGG CGT ATT TTC ACT GCA TTC TCC CTA TAT GAT GCG AGA ' AAT 
GTC ATT AAA AAT GGT GAT TTT AAT AAT GGC TTA TCC TGC TGG AAC GTG 
AAA GGG CAT GTA GAT GTA GAA GAA CAA AAC AAC CAA CGT TCG GTC CTT 
GTT GTT CCG GAA TGG GAA GCA GAA GTG TCA CAA GAA GTT CGT GTC TGT 
CCG GGT CGT GGC TAT ATC CTT CGT GTC ACA GCG TAC AAG GAG GGA TAT 
GGA GAA GGT TGC GTA ACC ATT CAT GAG ATC GAG AAC AAT ACA GAC GAA 
CTG AAG TTT AGC AAC TGC GTA GAA GAG GAA ATC TAT . CCA AAT AAC ACG 
GTA ACG TGT AAT GAT TAT ACT GTA AAT CAA GAA GAA TAC GGA GGT GCG 
TAC ACT TCT CGT AAT CGA GGA TAT AAC GAA GCT CCT TCC GTA CG*r GCT 



11:7^8 



1824 
1872 
192 0 
1968 
2016 
2064 
2112 
2160 
2208 
2256 
2304 
2352 
2400 
2448 
2496 
2544 
2592 
2640 
2688 
2736 
2784 
2832 
2 8 80 
2928 
2976 
3024 
3072 
3120 
3168 
3216 
3264 
3312 
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GAT TAT GCG TCA GTC TAT GAA GAA AAA TCG TAT ACA GAT GGA CGA AGA 
GAG AAT CCT TGT GAA TTT AAC AGA GGG TAT AGG GAT TAG .ACG CCA CTA 
CCA GTT GGT TAT GTG ACA AAA GAA TTA GAA TAC TTC CCA GAA ACC GAT 
AAG GTA TGG ATT GAG ATT GGA GAA ACG GAA GGA ACA TCT ^TC GTG GAG 
AGC GTG GAA TTA CTC CTT ATG GAG GAA TAG 



3360 

34{oa 

3456 
3534 



6 6 5 DNA Sequence Encoding the EG11751 Crystal Protein (SEQ ID NO:27 , 

ATG GAT AAC AAT CCG AAC ATC AAT GAA TGC ATT CCT TAT AAT TGT TTA 48 
AGT AAC CCT GAA GTA GAA GTA TTA GGT GGA GAA AGA ATA GAA ACT GGT 96. 

TAC ACC CCA ATC GAT ATT - TCC TTG TCG. CTA ACG CAA TTT CTT TTG AGT 144 

GAA TTT GTT CCC GGT GCT GGA TTT GTG TTA GGA CTA GTT GAT ATA ATA 192 

TGG GGA ATT TTT GGT CCC TCT CAA TGG GAC GCA TTT CTT GTA CAA ATT 240 

GAA CAG TTA ATT AAC CAA AGA ATA GAA GAA TTC GCT AGG AAC CAA GCC 288 

ATT TCT AGA TTA GAA GGA CTA AGC AAT CTT TAT CAA ATT TAC GCA GAA 336 

TCT TTT AGA GAG TGG GAA GCA GAT CCT ACT AAT CCA GCA TTA AGA GAA 3 84 

GAG ATG CGT ATT CAA TTC AAT GAC ATG AAC AGT GCC CTT ACA ACC GCT 432 

ATT CCT CTT TTT GCA GTT CAA AAT TAT CAA GTT CCT CTT TTA TCA GTA 480 

TAT GTT CAA GCT GCA AAT TTA CAT TTA TCA GTT TTG AGA GAT GTT TCA 528 

GTG TTT GGA CAA AGG TGG GGA TTT GAT GCC GCG ACT ATC AAT AGT CGT 57 6 
TAT AAT GAT TTA ACT AGG CTT ATT GGC AAC TAT ACA GAT TAT GCT GTA 624 
CGC TGG TAC AAT ACG GGA TTA GAA CGT GTA TGG GGA CCG GAT TCT AGA 672 
GAT TGG GTA AGG TAT AAT CAA TTT AGA AGA GAA TTA ACA CTA ACT GTA 720 
TTA GAT ATC GTT GCT CTG TTC CCG AAT TAT GAT AGT AGA AGA TAT CCA 768 
ATT CGA ACA GTT TCC CAA TTA ACA AGA GAA ATT TAT ACA AAC CCA GTA 816 
TTA GAA AAT TTT GAT GGT AGT TTT CGA GGC TCG GCT CAG GGC ATA GAA 864 
AGA AGT ATT AGG AGT CCA CAT TTG ATG GAT ATA CTT AAC AGT ATA ACC 912 
ATC TAT ACG GAT GCT CAT AGG GGT TAT TAT TAT TGG TCA GGG CAT CAA 960 

ATA ATG GCT TCT CCT GTA GGG TTT TCG GGG CCA GAA TTC ACT TTT CCG 1008 

CTA TAT GGA ACT ATG GGA AAT GCA GCT CCA CAA CAA CGT ATT GTT. GCT 1056 

CAA CTA GGT CAG GGC GTG TAT AGA ACA TTA TCG TCC ACT TTA TAT AGA 1104 

AGA CCT TTT AAT ATA GGG ATA AAT AAT CAA CAA CTA TCT GTT CTT GAC 1152 

GGG ACA GAA TTT GCT TAT GGA ACC TCC TCA AAT TTG CCA TCC GCT GTA 1200 

TAC AGA AAA AGC GGA ACG GTA GAT TCG CTG GAT GAA ATA CCG CCA CAG 1248 

AAT AAC AAC GTG CCA CCT AGG CAA GGA TTT AGT CAT CGA TTA AGC CAT 1296 

GTT TCA ATG TTT CGT TCA GGC TTT AGT AAT AGT AGT GTA AGT ATA ATA 1344 

AGA GCT CCT ATG TTC TCT TGG ATA CAT CGT AGT GCT GAA TTT AAT AAT 1392 

ATA ATT GCA TCG GAT AGT ATT ACT CAA ATA CCA TTG GTA AAA GCA CAT 1440 

ACA CTT CAG TCA GGT ACT ACT GTT GTA AGA GGG CCC GGG TTT ACG GGA 1488 

GGA GAT ATT CTT CGA CGA ACA AGT GGA GGA CCA TTT GCT TAT ACT ATT 1536 

GTT AAT ATA AAT GGG CAA TTA CCC CAA AGG TAT CGT GCA AGA ATA CGC 1584 
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TAT GCC TCT ACT ACA AAT CTA AGA ATT TAG GTA ACG GTT GCA GGT GAA 
CGG ; ATT TTT GCT GGT CAA TTT AAC AAA ACA ATG GAT ACC GGT GAC CCA 
TTA ACA TTC CAA TCT TTT ACT TAC GCA ACT ATT AAT ACA GCT TTT ACA 
TTC CCA ATG AGC GAG AGT AGT -TTC ACA GTA GGT GCT GAT ACT TTT AGT-, 
TCA GGG AAT GAA GTT TAT ATA GAC AGA TTT GAA TTG ATT CCA GTT ACT 
GCA ACA TTT GAA GCA GAA TAT GAT TTA GAA AGA GCA CAA AAG GCG GTG 
AAT GCG CTG TTT ACT TCT ATA AAC CAA ATA GGG ATA. AAA ACA GAT GTG 
ACG GAT TAT CAT ATT GAT CAA GTA TCC AAT TTA GTG GAT TGT TTA TCA 
GAT GAA TTT TGT CTG GAT GAA AAG CGA GAA TTG TCC GAG AAA GTC AAA 
CAT GCG AAG CGA CTC AGT GAT GAG CGG AAT TTA CTT CAA "GAT CCA AAC 
TTC AAA GGC ATC AAT AGG CAA CTA GAC CGT GGT TGG AGA GGA AGT ACG 
GAT ATT ACC ATC CAA AGA GGA GAT GAC GTA TTC AAA GAA AAT TAT GTC 
ACA CTA CCA GGT ACC TTT GAT GAG TGC TAT CCA ACA TAT TTG TAT CAA 
AAA ATC GAT GAA TCA AAA TTA AAA GCC TTT ACC CGT TAT CAA TTA AGA 
CGG TAT ATC GAA GAT AGT CAA GAC TTA GAA ATC TAT TTA ATT CGC TAC 
AAT GCA AAA CAT GAA ACA GTA AAT GTG CCA GGT ACG GGT TCC TTA TGG 
CCG CTT TCA GCC CAA AGT CCA ATC GGA AAG TGT GGA GAG CCG AAT CGA 
TGC GCG CCA CAC CTT GAA TGG AAT CCT GAC TTA GAT TGT TCG TGT AGG 
GAT GGA GAA AAG TGT GCC CAT CAT TCG CAT CAT TTC TCC TTA GAC ATT 
GAT GTA GGA TGT ACA GAC TTA AAT GAG GAC CTA GGT GTA TGG GTG ATC 
TTT AAG ATT AAG ACG CAA GAT GGG CAC GCA AGA CTA GGG AAT CTA GAG 
TTT CTC GAA GAG AAA CCA TTA GTA GGA GAA GCG CTA GCT CGT GTG AAA 
AGA GCG GAG AAA AAA TGG AGA GAC AAA CGT GAA AAA TTG GAA TGG GAA 
ACA AAT ATC GTT TAT AAA GAG GCA AAA GAA TCT GTA GAT GCT TTA TTT 
GTA AAC TCT CAA TAT GAT CAA TTA CAA GCG GAT ACG AAT ATT GCC ATG 
ATT CAT GCG GCA GAT AAA CGT GTT CAT AGC ATT CGA GAA GCT TAT CTG 
CCT GAG CTG TCT GTG ATT CCG GGT GTC AAT GCG GCT ATT TTT GAA GAA 
TTA GAA GGG GGT ATT TTC ACT GCA TTC TCC CTA TAT GAT GCG AGA AAT 
GTC ATT AAA AAT GGT GAT TTT AAT AAT GGC TTA TCC TGC TGG ASC GTG 
AAA GGG CAT GTA GAT GTA GAA GAA CAA AAC AAC CAA CGT TCG GTC CTT 
GTT GTT CCG GAA TGG GAA GCA GAA GTG TCA CAA GAA GTT CGT GTC TGT 
CCG GGT CGT GGC TAT ATC CTT CGT GTC ACA GCG TAC AAG GAG GGA TAT 
GGA GAA GGT TGC GTA ACC ATT CAT GAG ATC GAG AAC AAT ACA GAC GAA 
CTG AAG TTT AGC AAC TGC GTA GAA GAG GAA ATC TAT CCA AAT AAC ACG 
GTA ACG TGT AAT GAT TAT ACT GTA AAT CAA GAA GAA TAC GGA GGT GCG 
TAC ACT TCT CGT AAT CGA GGA TAT AAC GAA GCT CCT TCC GTA CCA GCT 
GAT TAT GCG TCA GTC TAT GAA GAA AAA TCG TAT ACA GAT GGA CGA AGA 
GAG AAT CCT TGT GAA TTT AAC AGA GGG TAT AGG GAT TAC ACG CCA CTA 
CCA GTT GGT TAT GTG ACA AAA GAA TTA GAA TAC TTC CCA GAA ACC GAT 
AAG GTA TGG ATT GAG ATT GGA GAA ACG GAA GGA ACA TTT ATC GTG GAC 
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AGC GTG GAA TTA CTC CTT ATG GAG GAA TAG 

■ * i. ■ 

6.6.6 DNA Sequence Encoding the EG11090 Crystal Protein (SEQ ip 1#29) 

ATG GAT AAC AAT CCG ,AAC ATC AAT GAA TGC ATT CGT TAT AAT; T^T. T^A. ; , .. & / 

AGT AAG CGT GAA GTA GAA GTA TTA GGT GGA GAA AGA AT A' GAA ACT GGT 96, 

TAG ACC CCA ATC GAT ATT TCC TTG TCG CTA AcG '6aA TT^. CTT CTij'AGT/ ,^4- 

GAA TTT GTT CCC GGT GCT GGA TTT GTG TTA GGA CTA s GTT GAT ATA ATA 792 " 

TGG GGA ATT TTT GGT CCC TCT CAA TGG GAC GCA TTT CTT GTA CAA ATT 240 

GAA GAG TTA ATT AAC CAA AGA ATA GAA GAA TTC GCT AGG .AAC CAA GCC 288 

ATT TCT AGA TTA GAA GGA CTA AGC AAT CTT. TAT CAA ATT- TAC GCA- GAA 336 

TCT TTT AGA GAG TGG GAA GCA GAT CCT ACT AAT CCA GCA TTA AGA GAA 384 

GAG ATG CGT ATT CAA TTC AAT GAC ATG AAC AGT GCC CTT ACA ACC GCT 432 

ATT CCT CTT TTT GCA GTT CAA AAT TAT CAA GTT CCT CTT TTA TCA GTA 480 

TAT GTT CAA GCT GCA AAT TTA CAT TTA TCA GTT TTG AGA GAT GTT TCA 528 

GTG TTT GGA CAA AGG TGG GGA TTT GAT GCC GCG ACT ATC AAT AGT CGT 576 

TAT AAT GAT TTA ACT AGG CTT ATT GGC AAC TAT ACA GAT TAT GCT GTA 624 

CGC TGG TAC AAT ACG GGA TTA GAA CGT GTA TGG GGA CCG GAT TCT AGA 672 

GAT TGG GTA AGG TAT AAT CAA TTT AGA AGA GAA TTA ACA CTA ACT GTA 720 

TTA GAT ATC GTT GCT CTG TTC CCG AAT TAT GAT AGT AGA AGA TAT. CCA 768 

ATT CGA ACA GTT TCC CAA TTA ACA AGA GAA ATT TAT ACA AAC CCA GTA 816 

TTA GAA AAT TTT GAT GGT AGT TTT CGA GGC TCG GCT CAG GGC ATA GAA 864 

AGA AGT ATT AGG AGT CCA CAT TTG ATG GAT ATA CTT AAC AGT ATA ACC 912 

ATC TAT ACG GAT GCT CAT AGG GGT TAT TAT TAT TGG TCA GGG CAT CAA 960 

ATA ATG GCT TCT CCT GTA GGG TTT TCG GGG CCA GAA TTC ACT TTT CCG 1008 

CTA TAT GGA ACT ATG GGA AAT GCA GCT CCA CAA CAA CGT ATT GTT GCT 1056 

CAA CTA GGT CAG GGC GTG TAT AGA ACA TTA TCG TCC ACT TTA TAT AGA 1104 

AGA CCT TTT AAT ATA GGG ATA AAT AAT CAA CAA CTA TCT GTT CTT GAC 1152 

GGG ACA GAA TTT GCT TAT GGA ACC TCC TCA AAT TTG CCA TCC GCT GTA 1200 

TAC AGA AAA AGC GGA ACG GTA GAT TCG CTG GAT GAA ATA CCG CCA. CAG 1248 

AAT AAC AAC GTG CCA CCT AGG CAA GGA TTT AGT CAT CGA TTA AGC CAT 1296 

GTT TCA ATG TTT CGT TCA GGC TTT AGT AAT AGT AGT GTA AGT ATA ATA 1344 

AGA GCT CCT ATG TTC TCT TGG ATA CAT CGT AGT GCA ACT CTT ACA AAT 1392 

ACA ATT GAT CCA GAG AGA ATT AAT CAA ATA CCT TTA GTG AAA GGA TTT 1440 

AGA GTT TGG GGG GGC ACC TCT GTC ATT ACA GGA CCA GGA TTT ACA GGA 1488 

GGG GAT ATC CTT CGA AGA AAT ACC TTT GGT GAT TTT GTA TCT CTA CAA 1536 

GTC AAT ATT AAT TCA CCA ATT ACC CAA AGA TAC CGT TTA AGA TTT CGT 1584 

TAC GCT TCC AGT AGG GAT GCA CGA GTT ATA GTA TTA ACA GGA GCG GCA 1632 

TCC ACA GGA GTG GGA GGC CAA GTT AGT GTA AAT ATG CCT CTT CAG AAA 1680 

ACT ATG GAA ATA GGG GAG AAC TTA ACA TCT AGA ACA TTT AGA TAJ ACC 1728 

GAT TTT AGT AAT CCT TTT TCA TTT AGA GCT AAT CCA GAT ATA ATT GGG 1776 
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ATA ACT GAA CAA CCT CTA TTT GGT GCA GGT TCT ATT AGT AGC GGT GA^ ^ 

CTT TAT ATA GAT AAA ATT GAA ATT ATT CTA GCA GAT GCA ACA TTT GAA ,872 

GCA GAA TCT GAT TTA GAA AGA GCA CAA AAG GCG GTG CTG TTT ^ ; 

ACT TCT TCC AAT CAA ATC GGG TTA AAA ACC GAT GTG7 ACa GAT TAT ; CAT 196^. 

ATT GAT CAA GTA TCC AAT TTA GTG GAT % " # Tg| . - «* ~ | 

CTG GAT GAA AAG CGA GAA TTG TCC GAG AAA GTC AAA CAT GCG AAG CGA ; g |, 

CTC AGT GAT GAG CGG AAT TTA CTT CAA GAT CCA AAC. TTC AGA GGG ATC : 2112 
AAT AGA CAA CCA GAC CGT GGC TGG AGA GGA AGT ACA GAT ATT ACC ATC 
CAA GGA GGA GAT GAC GTA TTC AAA GAG AAT TAC GTC ACA CTA CCG GGT 

ACC GTT GAT GAG TGC TAT CCA ACG TAT TTA TAT CAG AAA -ATA GAT GAG .. 2 

TCG AAA TTA AAA GCT TAT ACC CGT TAT GAA TTA AGA GGG TAT ATC GAA 304 

GAT AGT CAA GAC TTA GAA ATC TAT TTG ATC CGT TAC AAT GCA AAA CAC 2352 
Z ATA GTA AAT GTG CCA GGC ACG GGT TCC TTA TGG CCG CTT TCA GCC 
CAA AGT CCA ATC -GGA AAG TGT GGA GAA CCG AAT CGA TGC GCG CCA CAC 

CTT GAA TGG AAT CCT GAT CTA GAT TGT TCC TGC AGA GAC GGG GAA AAA 2 .4 6 
TGT GCA CAT CAT TCC CAT CAT TTC ACC TTG GAT ATT GAT GTT GGA TGT 

ACA GAC TTA AAT GAG GAC TTA GGT GTA TGG GTG ATA TTC AAG ATT AAG 592 

ACG CAA GAT GGC CAT GCA AGA CTA GGG AAT CTA GAG TTT CTC GAA GAG 2640 

AAA CCA TTA TTA GGG GAA GCA CTA GCT CGT GTG AAA AGA GCG GAG AAG 2688 . 
AAG TGG AGA GAC AAA CGA GAG AAA CTG CAG TTG GAA ACA AAT ATT GTT 
TAT AAA GAG GCA AAA GAA TCT GTA GAT GCT TTA TTT GTA AAC TCT CAA 

TAT GAT AGA TTA CAA GTG GAT ACG AAC ATC GCA ATG ATT CAT GCG GCA 2832 
GAT AAA CGC GTT CAT AGA ATC CGG GAA GCG TAT CTG CCA GAG TTG TCT 
GTG ATT CCA GGT GTC AAT GCG GCC ATT TTC GAA GAA TTA GAG GGA CGT 

ATT TTT ACA GCG TAT TCC TTA TAT GAT GCG AGA AAT GTC ATT AAA AAT 976 

GGC GAT TTC AAT AAT GGC TTA TTA TGC TGG AAC GTG AAA GGT CAT GTA 024 

GAT GTA GAA GAG CAA AAC AAC CAC CGT TCG GTC CTT GTT ATC CCA GAA 3072 

TGG GAG GCA GAA GTG TCA CAA GAG GTT CGT GTC TGT CCA GGT CGT GGC 3120 

TAT ATC CTT CGT GTC ACA GCA TAT AAA GAG GGA TAT GGA GAG GGC TGC 3168 

GTA ACG ATC CAT GAG ATC GAA GAC AAT ACA GAC GAA CTG AAA TTC AGC 3216 

AAC TGT GTA GAA GAG GAA GTA TAT CCA AAC AAC ACA GTA ACG TGT AAT 3264 

AAT TAT ACT GGG ACT CAA GAA GAA TAT GAG GGT ACG TAC ACT TCT CGT 3312 

AAT CAA GGA TAT GAC GAA GCC TAT GGT AAT AAC CCT TCC GTA CCA GCT 360 

GAT TAC GCT TCA GTC TAT GAA GAA AAA TCG TAT ACA GAT GGA CGA AGA 3408 

GAG AAT CCT TGT GAA TCT AAC AGA GGC TAT GGG GAT TAC ACA CCA CTA 4 6 

CCG GCT GGT TAT GTA ACA AAG GAT TTA GAG TAC TTC CCA GAG ACC GAT 504 

AAG GTA TGG ATT GAG ATC GGA GAA ACA GAA GGA ACA TTC ATC GTG GAT 3552 
AGC GTG GAA TTA CTC CTT ATG GAG GAA 
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(1) GENERAL INFORMATION: . . 

(i) APPLICANT: Malvar, Thomas _ *" ' / =• ; v ; ;V?^ff : 

Gilmer, Amy Jelen. ' . r. < ■ ,■ -%-i':J X ' \- ■ 

Ciii TITLE OF INVENTION: HYBRID BACILLI 

Ul) WITH NOV^L BROAD SPECTRUM ^SECTJ CIDAL ACTIVITY ^ ; ^ ;., ■< y 
(iii) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Arnold, White & Durkee 

(B) STREET: P.O. Box 4433 

(C) CITY: Houston 

(D) STATE: Texas 

(E) COUNTRY: United States of America 

(F) ZIP: 77210 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Pa tent In Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Unknown 

(B) FILING DATE: Concurrently Herewith 

(C) CLASSIFICATION: Unknown 

(viii) ATTORNEY /AG-ENT INFORMATION: 

(A) NAME: Kitchell, Barbara S. 

(B) REGISTRATION NUMBER: 3 3,92 8 

"(C) REFERENCE/DOCKET NUMBER: MOBT:009 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (512) 418-3000 

(B) TELEFAX: (512) 474-7577 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(!) SEQUENCE CHARACTERISTICS: _ 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
■(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

23 

GGATAGCACT CATCAAAGGT ACC 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GAAGATATCC AATTCGAACA GTTT6CC. • ■ 

( 2 ) INFORMATION FOR SEQs. ID ,M 

(it- SEQUENCE CHARAQTERI?^S : : . ; 
(A) LENGTH : 28 base pairs 
/(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CATATTCTGC CTCGAGTGTT GCAGTAAC 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
CCCGATCGGC CGCATGC 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 
CATTGGAGCT CTCCATG 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
GCACTACGAT GTATCC 



(2) INFORMATION FOR SEQ ID NO : 7 : 
" (i) SEQUENCE CHARACTERISTICS: 

-114- 



A: 59587( 19270M.DOC) 



(A) LENGTH : N 20 base pairs • : < 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear . - , 
(xi) SEQUENCE. DESCRIPTION : SEQ ID NO: 7.: / ■ ' • ^ >r ^ y-: ^ 

CATCGTAGTG CAACTCTTAC . . ■ ■...;/. "■■ / '.■ : 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) -SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCAAGAAAAT ACTAGAGCTC TTGTTAAAAA AGGTGTTCC 

;v 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .3531 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: . 

ATG GAT AAC AAT CCG AAC ATC AAT GAA TGC ATT CCT TAT AAT TGT TTA 
Asp isn Asn Pro Asn He Asn Glu Cys lie Pro Tyr Asn Cys Leu 
15 10 15 

AGT AAC CCT GAA GTA GAA GTA TTA GGT GGA GAA AGA ATA GAA ACT GGT 
Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg lie Glu ThT Gly 

'20 25 30 - - 

TAC ACC CCA ATC GAT ATT TCC TTG TCG CTA ACG CAA TTT CTT TTG AGT" 
Sr Pro lie Asp He Ser Leu Ser Leu Thr Gin Phe Leu Leu Ser 

35 40 45 

GAA TTT GTT CCC GGT GCT GGA TTT GTG TTA GGA CTA GTT GAT ATA ATA 
Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp lie He 
50 55 60 

TGG GGA ATT TTT GGT CCC TCT CAA TGG GAC GCA TTT CTT GTA CAA ATT 
Trp Gly He Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin lie 
65 70 75 80 

GAA CAG TTA ATT AAC CAA AGA ATA GAA GAA TTC GCT AGG AAC CAA GCC 
Si Gin Leu lie Asn Gin Arg lie Glu Glu Phe Ala Arg Asn Qln Ala 
85 90 95 
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ATT TCT AGA TTA GAA GGA CTA AGC AAT CTT TAT CAA ATT TAC GCA GAA 
ill Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin lie Tyr Ala Glu 
100 105 

TCT TTT AGA GAG TGG GAA GCA GAT CCT ACT AAT CCA GCA TTA, AG* GAA 
Ser Phe Arg Glu Trp Glu Ala Asp .Pro Thr Asn Pro. Ala Leu, Arg Glu 
' 115 120 ,.. 1.25 ' . 



AGA ACT ATT AGG AGT CCA CAT TTG ATG GAT ATA CTT AAC AGT ATA ACC 
50 irg Ser lie Arg Ser Pro His Leu Met Asp He Leu Asn Ser He Thr 
290 295 300 



336 



3184 



TAP ATG CGT ATT CAA TTC AAT GAG ATG AAC AGT GCC CTT ACA-ACC GCT, : &3f, 
10 Glu Met Arg ne Gin Phe Asn.Asp Met Ash Ser Ala Leu Thr W^la / , 

130 i35 



480 



528 



ATT CCT CTT TTT GCA GTT CAA AAT TAT CAA GTT CCT CTT TTA TCA GTA 
III Pro Leu Phe Ala Val Gin Asn Tyr Gin Val Pro Leu Leu Ser Val 
15 145 150 155 

TAT GTT CAA GCT GCA AAT TTA CAT TTA TCA GTT TTG AGA GAT GTT TCA 
Tyr Val Gin Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp Val Ser 
165 I 70 

20 GTG TTT GGA CAA AGG TGG GGA TTT GAT GCC GCG ACT ATC AAT AGT CGT 576 

Val Phe Gly Gin Arg Trp Gly Phe Asp Ala Ala Thr lie Asn Ser Arg 

25 TAT AAT GAT TTA ACT AGG CTT ATT GGC AAC TAT ACA GAT TAT GCT GTA 624 

25 Sr £s"n Jsp Leu Thr Arg Leu lie Gly Asn Tyr Thr, Asp Tyr Ala Val 
195 200 20b 

CGC TGG TAC AAT ACG GGA TTA GAA CGT GTA TGG GGA CCG GAT TCT AGA 
30 Arg Trp £r Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg 

GAT TGG GTA AGG TAT AAT CAA TTT AGA AGA GAA TTA ACA CTA ACT GTA 
Asp Trp Val Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 
35 225 230 235 

TTA GAT ATC GTT GCT CTG TTC CCG AAT TAT GAT AGT AGA AGA TAT CCA 
llu Asp lie Val Ala Leu Phe Pro Asn Tyr Asp Ser Arg Arg Tyr Pro 

40 ATT CGA ACA GTT TCC CAA TTA ACA AGA GAA ATT TAT ACA AAC CCA GTA 816 

ill Arg S Val Ser Gin Leu Thr Arg Glu lie Tyr Thr Asn Pro Val 

260 265 

■ j — - - 

45 TTA GAA AAT TTT GAT GGT AGT TTT CGA GGC TCG GCT CAG GGC ATA GAA 

45 III G^u A^n III Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly He Glu 
275 280 285 



672 



720 



768 



864 



912 



960 



1008 



ATC TAT ACG GAT GCT CAT AGG GGT TAT TAT TAT TGG TCA GGG CAT CAA 

ill Sr Tnr Asp Ala His Arg Gly Tyr Tyr Tyr Trp Ser Gly Hxs Gin 
55 305 310 315 

' ATA ATG GCT TCT CCT GTA GGG TTT TCG GGG CCA GAA' TTC ACT TTT CCG 

lie Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 

325 330 . • 

60 CTA TAT GGA ACT ATG GGA AAT GCA GCT CCA CAA CAA CGT ATT GTT GCT 1056 
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Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg. lie Val Al;a 
-340 345 350 . 

i 

CAA CTA GGT C AG GGC GTG TAT AGA ACA TTA TCG TCC ACT TTA TAT AGA 
Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr Arg 

355 '360 :f- ; • ■ - ;:-^65- ; =/••; 

AGA CCT TTT AAT ATA GGG ATA AAT AAT CAA CAA CTA TCT GTT CTT GAC 

Arg Pro Phe Asn He Gly He Asn Asn Gin Gin Leu/Ser V ? l , ; Leu Asp- 

370 375 \;3|0;>- : .:. ■ ; V 

GGG ACA GAA TTT GCT TAT GGA ACC TCC TCA AAT TTG CCA TCC GCT GTA 
Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala Val 
385 390 395 . 400 

TAC AG\ AAA AGC GGA ACG GTA GAT TCG CTG GAT GAA ATA CCG CCA CAG 
Tvr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu He Pro Pro Gin 
405 410 415 

AAT AAC AAC GTG CCA CCT AGG CAA GGA TTT AGT CAT CGA TTA AGC CAT 
Asn Asn Asn Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser His 
420 425 430 

GTT TCA ATG TTT CGT TCA GGC TTT AGT AAT AGT AGT GTA AGT ATA ATAi 
Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser lie He 
435 440 445 

AGA GCT CCA ATG TTT TCT f GG ACG CAC CGT AGT GCA ACC CCT ACA AAT 
Arg Ala Pro Met Phe Ser Trp Thr His Arg Ser Ala Thr Pro Thr Asn 
450 455 460 

ACA ATT GAT CCG GAG AGG ATT ACT CAA ATA CCA TTG GTA AAA GCA CAT 
Thr He Asp Pro Glu Arg He Thr Gin He Pro Leu Val Lys Ala His 
465 470 475 480 

ACA CTT CAG TCA GGT ACT ACT GTT GTA AGA GGG CCC GGG TTT ACG GGA 
Thr Leu Gin Ser Gly Thr Thr Val Val Arg Gly Pro Gly Phe Thr Gly 
485 490 495 

GGA GAT ATT CTT CGA CGA ACA AGT GGA GGA CCA TTT GCT TAT ACT ATT 
Gly Asp He Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Tyr Thr He 
500 505 510 _ 

GTT AAT ATA AAT GGG CAA TTA CCC CAA AGG TAT CGT GCA AGA ATA .CGC 
Val Asn He Asn Gly Gin Leu Pro Gin Arg Tyr Arg Ala Arg He Arg 
515 520 525 

TAT GCC TCT ACT ACA AAT CTA AGA ATT TAC GTA ACG GTT GCA GGT GAA 
Tyr Ala Ser Thr Thr Asn Leu Arg He Tyr Val Thr Val Ala Gly Glu 
530 535 540 

CGG ATT TTT GCT GGT CAA TTT AAC AAA ACA ATG GAT ACC GGT GAC CCA 
Ara He Phe Ala Gly Gin Phe Asn Lys Thr Met Asp Thr Gly Asp Pro 
545 550 555 560 

TTA ACA TTC CAA TCT TTT AGT TAC GCA ACT ATT AAT ACA GCT TTT ACA 
Leu Thr Phe Gin Ser Phe Ser Tyr Ala Thr He Asn' Thr Ala Phe Thr 
565 570 575 

TTC CCA ATG AGC CAG AGT AGT TTC ACA GTA GGT GCT GAT ACT TTT AGT 
Phe Pro Met Ser Gin Ser Ser Phe Thr Val Gly Ala Asp Thr Phe Ser 
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580 585 590 

TCA GGG AAT GAA GTT TAT ATA GAC AGA TTT GAA TTG ATT CCA GTT ACT 1824 ' 

Ser Gly Asn Glu Val Tyr He Asp Arg Phe Glu Leu He Pro Val Thr 
595 600 605 : ■ 

GCA ACA TTT GAA GCA GAA TAT GAT TTA GAA AGA GCA CAA AAG;GCG;gTG .1872. 
Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg- Ala Gin Lys Ala Val • . v 

6io 615 ' 620 ■ ; -v 

AAT GCG CTG TTT ACT TCT ATA AAC CAA ATA GGG ATA -AAA ACA GAT GTG ., 1920 
Asn Ala Leu Phe Thr Ser He Asn Gin lie Gly He Lys Thr Asp Val 
625 630 635 640 

15 ACG GAT TAT CAT ATT GAT CAA GTA TCC AAT TTA GTG GAT TGT TTA TCA 1968 

Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val Asp Cys Leu Ser 
545 650 655 

GAT GAA TTT TGT CTG GAT GAA AAG CGA GAA TTG TCC GAG AAA GTC AAA 2016 
20 Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys 

660 665 670 

CAT GCG AAG CGA CTC AGT GAT GAG CGG AAT TTA CTT CAA GAT CCA AAC 
His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn 
25 675 680 685 

TTC AAA GGC ATC AAT AGG CAA CTA GAC CGT GGT TGG AGA GGA AGT ACG 
Phe Lys Gly He Asn Arg Gin Leu Asp Arg Gly Trp Arg Gly Ser Thr 
690 " 695 700 

-in 

GAT ATT ACC ATC CAA AGA GGA GAT GAC GTA TTC AAA GAA AAT TAT GTC 
Asp He Thr He Gin Arg Gly Asp Asp Val Phe Lys Glu Asn Tyr Val 
705 710 715 720 

35 ACA CTA CCA GGT ACC TTT GAT GAG TGC TAT CCA ACA TAT TTG TAT CAA 

Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin 
725 730 735 

AAA ATC GAT GAA TCA AAA TTA AAA GCC TTT ACC CGT TAT CAA TTA AGA 
40 Lys He Asp Glu Ser Lys Leu Lys Ala Phe Thr Arg Tyr Gin Leu Arg 

740 745 750 

GGG TAT ATC GAA GAT AGT CAA GAC TTA GAA ATC TAT TTA ATT CGC-TAC 
Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He Arg. Tyr 
45 755 760 765- 

AAT GCA AAA CAT GAA ACA GTA AAT GTG CCA GGT ACG GGT TCC TTA -TGG 
S£n Ala Lys His Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu Trp 
770 775 780 



50 



55 



CCG CTT TCA GCC CAA AGT CCA ATC GGA AAG TGT GGA GAG CCG AAT CGA 
Pro Leu Ser Ala Gin Ser Pro He Gly Lys Cys Gly Glu Pro Asn Arg 
785 790 795 800 



GAT GGA GAA AAG TGT GCC CAT CAT TCG CAT CAT TTC TCC TTA GAC.ATT 
60 Asp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu Asp He 

820 825 . 830 - , 



2064 



2.112 



2160 



2208 



2256 



2304 



2352 



2400 



TGC GCG CCA CAC CTT GAA TGG AAT CCT GAC TTA GAT TGT TCG TGT AGG 2448 
Cys Ala Pro His Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg 
805 810 815 



2496 
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GAT GTA GGA TGT ACA GAC TTA AST GAG GAC CTA GGT GTA TGG GTG ATC 
Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val. Trp Val lie 
835 840 845 

TTT AAG ATT AAG ACG CAA GAT. GGG CAC GGA AGA CTA; GGG. AAT CTA GAG 
Phe Lys lie. Lys Thr Gin Asp Gly His Ala Arg Leu- Gly Asn Leu Glu 

850 sss . / 860 v> : . . ^ . 

TTT CTC GAA GAG AAA CCA TTA GTA GGA GAA GCG CTA GCT CGT GTG AAA 
Phe Leu Glu Glu Lys Pro Leu Val Gly Glu Ala Leu Ala Arg Val Lys 
865 870 875 880 

AGA GCG GAG AAA AAA TGG AGA GAC AAA CGT GAA AAA TTG GAA TGG GAA 
Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Glu Trp, Glu 
885 890 895 

ACA AAT ATC GTT TAT AAA GAG GCA AAA GAA TCT GTA GAT GCT TTA TTT 
Thr Asn He Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe 
900 " 905 910 

GTA AAC TCT CAA TAT GAT CAA TTA CAA GCG GAT ACG AAT ATT GCC ATG 
Val Asn Ser Gin Tyr Asp Gin Leu Gin Ala Asp Thr Asn He Ala Met 
915 920 925 

ATT CAT GCG GCA GAT AAA CGT GTT CAT AGC ATT CGA GAA GCT TAT CTG 
lie His Ala Ala Asp Lys Arg Val His Ser lie Arg Glu Ala Tyr Leu 
930 935 940 

CCT GAG CTG TCT GTG ATT CCG GGT GTC AAT GCG GCT ATT TTT GAA GAA 
Pro Glu Leu Ser Val lie Pro Gly Val Asn Ala Ala He Phe Glu Glu 
945 950 955 960 

TTA GAA GGG CGT ATT TTC ACT GCA TTC TCC CTA TAT GAT GCG AGA AAT 
Leu Glu Gly Arg He Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn 
965 970 975 

GTC ATT AAA AAT GGT GAT TTT AAT AAT GGC TTA TCC TGC TGG AAC GTG 
Val He Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn Val 
980 985 990 

AAA GGG CAT GTA GAT GTA GAA GAA CAA AAC AAC CAA CGT TCG GTC CTT 
Lys Gly His Val Asp Val Glu Glu Gin Asn Asn Gin Arg Ser VaT Leu 
995 ' " 1000 1005 . . - 

GTT GTT CCG GAA TGG GAA GCA GAA GTG TCA CAA GAA GTT CGT GTC TGT' 
Val Val Pro Glu Trp Glu Ala Glu Vc.l Ser Gin Glu Val Arg Val Cys 
1010 1015 1020 

CCG GGT CGT GGC TAT ATC CTT CGT GTC ACA GCG TAC AAG GAG GGA TAT 
Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr 
1025 I 030 1035 

GGA GAA GGT TGC GTA' ACC ATT CAT GAG ATC GAG AAC AAT ACA GAC GAA 
Gly Glu Gly Cys Val Thr He His Glu He Glu Asn Asn Thr Asp Glu 
1045 1050 1055 

CTG AAG TTT AGC AAC TGC GTA GAA GAG GAA ATC TAT CCA AAT AAC ACG 
Leu Lys Phe Ser Asn Cys Val -Glu Glu Glu He Tyr Pro Asn Asa Thr 
1060 - 1065 1070 
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GTA ACG TGT AAT GAT TAT ACT GTA AAT GAA;- GAA GAA; TAC .GGA GGT, GCG, 
Val Thr Cys Asn Asp Tyr Thr. Val Asn Gin Glu Glu' Tyr Gly Gly Ala, 
1075 1080 ' 1085 

TAC ACT TCT CGT AAT CGA GGA TAT AAC GAA GCT CCT TCC GTA CCA JgCT ! 
Sr tS sS Arg Asn ,Arg Gly Tyr Asn Glu Ala Proper Val Previa 
1090 • • " ' 1095 1100- - r 

GAT TAT GCG TCA GTC TAT GAA GAA AAA TCG TAT AC A GAT GGA CGA , A.GA 
Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser fl£ Thr Asp Gly Arg Arg.. 
1105 1110 

GAG AAT CCT TGT GAA TTT AAC AGA GGG TAT AGG GAT TAC ACG CCA CTA 
Gil £n Pro Cys Glu Phe Asn Arg Gly Tyr Arg Asp Tyr Thr Pro Leu 
1125 H30 H« 

CCA GTT GGT TAT GTG ACA AAA GAA TTA GAA TAC TTC CCA GAA ACC GAT 
Pro Val Sy Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro G lu Thr Asp 
1140 1145 llb 

AAG GTA TGG ATT GAG ATT GGA GAA ACG GAA GGA ACA TTT ATC GTG GAC 
Trp ] 
1155 



AAG GTA TGG ATT GAG ATT GGA GAA J~ '\ 7r T a ~ 

Ss Va^ Jrp He Glu lie Gly Glu Thr Glu Gly Thr Phe He Val Asp 
* 1160 llbb 



AGC GTG GAA TTA CTC CTT ATG GAG GAA 
Ser Val Glu Leu Leu Leu Met Glu Glu 
1170 H75 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1177 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Asp Asn Asn Pro Asn lie Asn Glu Cys He Pro Tyr Asn Cys Leu 
! 5 1° _ 

Ser Asn Pro G £u Val Glu Val Leu Gly Gly Glu Arg He Gl.u Thr Gly 
20 25 _ 

Tyr Thr Pro He Asp lie Ser Leu Ser Leu Thr Gin Phe Leu Leu <Ser 
3 5 40 4:3 

Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp lie lie 
50 55 60 

Trp Gly He Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin lie 
65 70 75 

Glu Gin Leu lie Asn Gin Arg lie Glu Glu Phe Ala Arg Asn Gin Ala 
85 90 

lie Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin lie Tyr Al&_Glu 

105 i - LU 



100 



A: 595R7(19Z70I!.DOC) 



-120- 



C - c 



Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Arg Glu ^ 

" - ■ ' 115 12d 125 : , 

Glu Met Arg lie Gin Phe Asn Asp Met Asn Ser 'Ala Leu Thr Thr, Ala 

no ■ •■ . . . 135 ■ 140 ■ n^^ : r, , ; - 



Ile Pro • Leu Phe, Ala Val Gin Asn Tyr Gln V Val Pro Leu : lieU iSer Vk.V 
145 150. • 155 • \y ; ; 160 

Tyr Val Gin Ala Ala Asn Leu His Leu ,Ser ; Val Leu Arg Asp Val /Ser 
165 170 • " *175 ' 

Val Phe Gly Gin Arg Tr-p Gly Phe Asp Ala Ala Thr He Asn Ser Arg 
180 185 190 

Tyr Asn Asp Leu Thr Arg Leu lie Gly Asn Tyr Thr Asp Tyr Ala Val 
195 . 200 205 

Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg 
210 215 220 

Asp Trp Val Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 
225 230 235 240 

Leu Asp He Val Ala Leu Phe Pro Asn Tyr Asp Ser Arg Arg Tyr Pro 
245 250 255 

He Arg Thr Val Ser Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val 
260 265 270 

Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly He Glu 
275 280 285 

Arg Ser He Arg Ser Pro His Leu Met Asp He Leu Asn Ser He Thr 
290 - 2-9T5 300 

He Tyr Thr Asp Ala His Arg Gly Tyr Tyr Tyr Trp Ser Gly His Gin 
305 310 315 320 

He Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 
325 330 335 

Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg He Valt-Ala 
340 345 350 . . 

Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr Arg 
355 360 365 

Arg Pro Phe Asn He Gly He Asn Asn Gin Gin Leu Ser Val Leu Asp 
370 375 380 

Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala Val 

385 390 ... 395 400 

Tvr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu He Pro Pro Gin 
405 ••- 410 . 415 

Asn Asn Asn Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser His 
420 425 430 

Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser lie He 
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435 440 ,445 ■. 

Arg Ala Pro Met Phe Ser Trp Thr His Arg Ser Ala Thr Pro Thr Asn 
450 455 . 460 

Thr lie Asp Pro Glu Arg' lie Thr Gin He Pro Leu Val Lys Ala His 

465 470 475- >■ ■ ■ * ;*W 

■■.'••*>.' 

Thr Leu Gin Ser Gly Thr Thr Val Val Arg Gly Pro Gly Phe Thr Gly 

485 ; '490. . 49.5 ? .■■ ■ 

* ' • ■ . :■ " • •*:•<<, ; * 

Gly Asp He Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Tyr Thr He 
500 ~ 505 510 

Val Asn He Asn Gly Gin Leu Pro Gin Arg Tyr Arg Ala Arg lie Arg 
515 520 525 

Tyr Ala Ser Thr Thr Asn Leu Arg He Tyr Val Thr Val Ala Gly Glu 
530 535 540 

Arg He Phe Ala Gly Gin Phe Asn Lys Thr Met Asp Thr Gly Asp Pro 
545 550 555 560 

Leu Thr Phe Gin Ser Phe Ser Tyr Ala Thr He Asn Thr Ala Phe Thr 
565 570 575 

Phe Pro Met Ser Gin Ser Ser Phe Thr Val Gly Ala Asp Thr Phe Ser 
580 585 590 

Ser Gly Asn Glu Val Tyr He Asp Arg Phe Glu Leu He Pro Val Thr 
595 600 605 

Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gin Lys Ala Val 
610 615 620 

Asn Ala Leu Phe Thr Ser He Asn Gin He Gly He Lys Thr Asp Val 
625 630 635 640 

Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val Asp Cys Leu Ser 
645 650 655 

Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys 
660 665 670 — 

His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn 
675 680 685 

Phe Lys Gly He Asn Arg Gin Leu Asp Arg Gly Trp Arg Gly Ser Thr 
690 " 695 700 

Asp He Thr He Gin Arg Gly Asp Asp Val Phe Lys Glu Asn Tyr Val 
705 710 715 720 

Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin 
725 730 735 

Lys He Asp Glu Ser Lys Leu Lys Ala Phe Thr Arg 'Tyr Gin Leu Arg 
740 745 750 

Gly Tyr He Glu Asp Ser Gin Asp Leu Glu lie Tyr Leu He Arg Tyr 
755 760 765 
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Asn Ala Lys His Glu Thr Val Asn Val Pro Gly Thr ;.Gly Ser LeuTrp* \ 
770 775 . 780 

5 Pro Leu Ser Ala Gin Ser Pro He Gly Lys Cys Gly Glu Pro Asn" Arg, 
785 790 795 :. ., . 80 °. 

Cys Ala Pro His Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg 

805 810 : /-fits-. .,y>.:- 

in < - ■"■ . • ., ' 

Asp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu Asp He 

820 825 830 

Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val lie 
15 835 840, 845 

Phe Lys He Lys Thr Gin Asp Gly His Ala Arg Leu Gly Asn Leu Glu 
850 855 860 

20 Phe Leu Glu Glu Lys Pro Leu Val Gly Glu Ala Leu Ala Arg Val Lys 
865 870., 875 880 

Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Glu Trp Glu 
885 890 895 

Thr Asn He Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala^ Leu Phe 
900 905 910 



25 



Val Asn Ser Gin Tyr Asp Gin Leu Gin Ala Asp Thr Asn lie Ala Met 
30 915 920 925 

He His Ala Ala Asp Lys Arg Val His Ser He Arg Glu Ala Tyr Leu 
930 - 935 940 

35 Pro Glu Leu Ser Val Ile'Pro Gly Val Asn Ala Ala He Phe Glu Glu 

945 950 955 960 

Leu Glu Gly Arg He Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn 
965 970 975 

40 

Val He Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn Val 
980 985 990 ^ 

Lys Gly His Val Asp Val Glu Glu Gin Asn Asn Gin Arg Ser Val Leu 
45 995 1000 i00-5 

Val Val Pro Glu Trp Glu Ala Glu Val Ser Gin Glu Val Arg Val Cys 
1010 1015 I 020 

50 Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr 

1025 1030 1035 1040 

Gly Glu Gly Cys Val Thr He His Glu lie Glu Asn Asn Thr Asp Glu 
1045 1050 1055 

Leu Lys Phe Ser Asn Cys Val Glu Glu Glu He Tyr Pro Asn Asn Thr 
1060 " 1065 1070 

Val Thr Cys Asn Asp Tyr Thr Val Asn Gin Glu Glu Tyr Gly Gly. Ala 
60 1075 ~ 1080 1085 



55 
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10 



15 



Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Glu Ala Pro Ser Val Pro Ala 
1090 " 1095 HOC 

Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly -Arg Arg 
1105 1.110 HIS ■« . . 1120 

Glu Asn Pro Cys Glu Phe Asn Arg Gly Tyr Arg' Asp: Tyr Thr Pro Leu 
1125 1130 1135 .,. •; 

t ■ . " ' '■■ ' 
Pro Val Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Prb Glu Thr Asp , 
1140 ' 1145 ' ; * H'50 f ! ■ 

Lys Val Trp He Glu He Gly Glu Thr Glu Gly Thr Phe He Val Asp 
1155 H60 H65 

Ser Val Glu Leu Leu Leu Met Glu Glu 
1170 H75 



20 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3531 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
30 (B) LOCATION: 1..3531 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG GAT AAC AAT CCG AAC ATC AAT GAA TGC ATT CCT TAT AAT TGT TTA 
35 Met Asp Asn Asn Pro Asn lie Asn Glu Cys He Pro Tyr Asn Cys Leu 
15 10 ' I 5 

AGT AAC CCT GAA GTA GAA GTA TTA GGT GGA GAA AGA ATA GAA ACT GGT 
Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg He Glu Thr Gly 
40 ' 20 25 30 

TAC ACC CCA ATC GAT ATT TCC TTG TCG CTA ACG CAA TTT CTT TTG AGT 
Tyr Thr Pro He Asp He Ser Leu Ser Leu Thr Gin Phe Leu Le^-Ser 
35 * ' 40 45 _ . 

45 

GAA TTT GTT CCC GGT GCT GGA TTT GTG TTA GGA CTA GTT GAT ATA ATA - 
Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp He -He 
50 55 60 

50 TGG GGA ATT TTT GGT CCC TCT CAA TGG GAC GCA TTT CTT GTA CAA ATT 

Trp Gly He Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin lie 
65 70 75 80 

GAA CAG TTA ATT AAC CAA AGA ATA GAA GAA TTC GCT AGG AAC CAA GCC 
55 Glu Gin Leu He Asn Gin Arg He Glu Glu Phe Ala Arg Asn Gin Ala - 

85 90 _ 95 

ATT TCT AGA TTA GAA GGA CTA AGC AAT CTT TAT CAA ATT TAC GCA GAA 
He Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin He Tyr AlaJSlu 
60 100 105 no 



48 



96 



144 



192 



240 



288" 



336 
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45 



60 



CAA CTA GGT CAG GGC GTG TAT AGA ACA TTA TCG TCC ACT TTA TAT AGA 

-125- 



f 



TCT TTT AGA GAG TGG GAA GCA GAT CCT ACT AAT CCA GCA TTA AGA GAA 384 
Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Arg Glu 
U5 120 125 

5 GAG ATG CGT ATT CAA TTC AAT GAC ATG AAC AGT GCC CTT ACA ACC GOT 432 '.. 

Giu Met Arg lie Gin Phe Asn Asp Met Asn Ser Ala Leu Thr Thr Ala 4 , 

130 " 135 •■' 14"0 ■ 

ATT CCT CTT TTT GCA' GTT CAA AAT TAT /CAA GTT CGT' CTT TTA TCA GTA ■ 480 . : 
10 He Pro Leu Phe Ala Val Gin Asn Tyr Gin Val Pro. Leu Leu Ser Val. ; . 

145 150 155 • 160 

TAT G TT CAA GCT GCA AAT TTA CAT TTA TCA GTT TTG AGA GAT GTT TCA 528 
Tvr Val Gin Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp Val Ser 
15 165 170 ' 175 

GTG TTT GGA CAA AGG TGG GGA TTT GAT GCC GCG ACT ATC AAT AGT CGT 576 

Val Phe Gly Gin Arg Trp Gly Phe Asp Ala Ala Thr He Asn Ser Arg 
180 " 165 19° 

20 

TAT AAT GAT TTA ACT AGG CTT ATT GGC AAC TAT ACA GAT TAT GOT GTA 
Tyr Asn Asp Leu Thr Arg Leu He Gly Asn Tyr Thr Asp Tyr Ala Val 
195 200 205 

25 CGC TGG TAC AAT ACG GGA TTA GAA CGT GTA TGG GGA CCG GAT TCT AGA 

Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg 
210 215 220 

GAT TGG GTA AGG TAT AAT CAA TTT AGA AGA GAA TTA ACA CTA ACT GTA 
30 Asp Trp Val Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 

225 230 235 24U 

TTA GAT ATC GTT GCT CTG TTC CCG AAT TAT GAT AGT AGA AGA TAT CCA 
Leu Asp He Val Ala Leu Phe Pro Asn Tyr Asp Ser Arg Arg Tyr Pro 
35 245 250 255 

ATT CGA ACA GTT TCC CAA TTA ACA AGA GAA ATT TAT ACA AAC CCA GTA 
He Arg Thr Val Ser Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val 
260 265 270 



624 



672 



720 



768 



816 



864 



TTA GAA AAT TTT GAT GGT AGT TTT CGA GGC TCG GCT CAG GGC ATA GAA 
Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly lie Glu 
275 280 285 — 

AGA AGT ATT AGG AGT CCA CAT TTG ATG GAT ATA CTT AAC AGT ATA ACC 912 
Arg Ser He Arg Ser Pro His Leu Met Asp He Leu Asn Ser He Thr- 
290 295 300 

ATC TAT ACG GAT GCT CAT AGG GGT TAT TAT TAT TGG TCA GGG CAT CAA 960 
50 He Tyr Thr Asp Ala His Arg Gly Tyr Tyr Tyr Trp Ser Gly His Gin 

305 310 315 320 

ATA ATG GCT TCT CCT GTA GGG TTT TCG GGG CCA GAA TTC ACT TTT CCG 1008 
He Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 
55 ~ 325 330 335 

CTA TAT GGA ACT ATG GGA AAT GCA GCT CCA CAA CAA' CGT ATT GTT GCT 1056 
Leu Tvr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg He Val Ala 
340 345 350 
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Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr Arg 
355 360 365 

AGA CCT TTT AAT ATA GGG ATA AAT AAT CAA CAA CTA TCT GTT CTT GAC 
Arg Pro Phe Asn lie Gly He Asn Asn Gin Gin, Leu .Ser Val Leu Asp 

370 375 380 , : - . . ■' ;^ ,. 

GGG ACA GAA TTT GCT TAT GGA ACC TCC TCA AAT TTG CCA TCC GCT GTA 
Glv Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leji Pro Ser Ala Val 
385 390 395 • 

TAC AGA AAA AGC GGA ACG GTA GAT TCG CTG GAT GAA ATA CCG CCA CAG 
Tvr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu He Pro Pro Gin 
405 410 . 415 

AAT AAC AAC GTG CCA CCT AGG CAA GGA TTT AGT CAT CGA TTA AGC CAT 
Asn Asn Asn Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser His 
420 425 430 

GTT TCA ATG TTT CGT TCA GGC TTT AGT AAT AGT AGT GTA AGT ATA ATA 
Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser He lie 
435 440 445 

AGA GCT CCA ATG TTT TCT TGG ACG CAC CGT AGT GCA ACC CCT ACA AAT 
Arg Ala Pro Met Phe Ser Trp Thr His Arg Ser Ala Thr Pro Thr Asn 
450 455 460 

ACA ATT GAT CCG GAG AGG ATT ACT CAA ATA CCA TTG GTA AAA GCA CAT 
Thr He Asp Pro Glu Arg He Thr Gin He Pro Leu Val Lys Ala His 
465 470 475 480 

ACA CTT CAG TCA GGT ACT ACT GTT GTA AGA GGG CCC GGG TTT ACG GGA 
Thr Leu Gin Ser Gly Thr Thr Val Val Arg Gly Pro Gly Phe Thr Gly 
485 490 495 

GGA GAT ATT CTT CGA CGA ACA AGT GGA GGA CCA TTT GCT TAT ACT ATT 
Gly Asp He Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Tyr Thr He 
500 505 510 

GTT AAT ATA AAT GGG CAA TTA CCC CAA AGG TAT CGT GCA AGA ATA CGC 
Val Asn He Asn Gly Gin Leu Pro Gin Arg Tyr Arg Ala Arg He Arg 
515 520 525 

TAT GCC TCT ACT ACA AAT CTA AGA ATT TAC GTA ACG GTT GCA GGT. GAA 
Tyr Ala Ser Thr Thr Asn Leu Arg He Tyr Val Thr Val. Ala Gly Glu 
530 535 540 

CGG ATT TTT GCT GGT CAA TTT AAC AAA ACA ATG GAT ACC GGT GAC CCA 
Arg He Phe Ala Gly Gin Phe Asn Lys Thr Met Asp Thr Gly Asp Pro 
545 550 555 560 

TTA ACA TTC CAA TCT TTT AGT TAC GCA ACT ATT AAT ACA GCT TTT ACA 
Leu Thr Phe Gin Ser Phe Ser Tyr Ala Thr He Asn Thr Ala Phe Thr 
565 570 575 

TTC CCA ATG AGC CAG AGT AGT TTC ACA GTA GGT GCT GAT ACT TTT AGT 
Phe Pro Met Ser Gin Ser Ser Phe Thr Val Gly Ala Asp Thr Phe Ser 
580 585 590 

TCA GGG AAT GAA GTT TAT ATA GAC AGA TTT GAA TTG ATT CCA GTT ACT 
Ser Gly Asn Glu Val Tyr He Asp Arg Phe Glu Leu He Pro Val Thr 
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595 



600 605 



GCA ACA CTC GAG GCT GAA TAT AAT CTG GAA AGA GCG CAG AAG GCG GTG 
Ala Thr Leu Glu Ala Glu Tyr Asn Leu Glu Arg Ala Gin Lys Ala Val 

615 620 ;. ;? ; 



610 



AAT GCG GTG TTT ACG TCT ACA AAC CAA CTA" GGG CTA AAA ACA AAT ,Gf A 
Asn Ala Leu Phe Thr Ser Thr Asn Gin Leu Gly Leu Lys .Thr Asn Val 
625 630 635 640 



ACG GAT TAT CAT ATT GAT CAA GTG TCC AAT TTA GTT ACG TAT TTA TCG 
Thr Asp Tyr His lie Asp Gin Val Ser Asn Leu Val Thr Tyr Leu Ser 
" ^ 650 655 



645 



GAT GAA TTT TGT CTG GAT GAA AAG CGA GAA TTG TjCC GAG 1 AAA GTC AAA 
Asp Glu Phe Cys Leu Asp Glu Lys Arg- Glu Leu Ser Glu Lys Val Lys 

665 670 



660 



CAT GCG AAG CGA CTC AGT GAT GAA CGC AAT TTA CTC CAA GAT TCA AAT. 
His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Ser Asn 
675 ~ 680 685 

TTC AAA GAC ATT AAT AGG CAA CCA GAA CGT GGG TGG GGC GGA AGT ACA 
Phe Lys Asp lie Asn Arg Gin Pro Glu Arg Gly Trp Gly Gly Ser Thr 
690 ' 695 700 

GGG ATT ACC ATC CAA GGA GGG GAT GAC GTA TTT AAA GAA AAT TAC GTC 
Gly He Thr He Gin Gly Gly Asp Asp Val Phe Lys Glu Ash Tyr Val 
705 710 715 720 

ACA CTA TCA GGT ACC TTT GAT GAG TGC TAT CCA ACA TAT TTG TAT CAA 
Thr Leu Ser Gly Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin 
725 - 730 735 

AAA ATC GAT GAA TCA AAA TTA AAA GCC TTT ACC CGT TAT CAA TTA AGA 
Lys He Asp Glu Ser Lys Leu Lys Ala Phe Thr Arg Tyr Gin Leu Arg 
740 • 745 750 

GGG TAT ATC GAA GAT AGT CAA GAC TTA GAA ATC TAT TTA ATT CGC TAC 
Glv Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He Arg Tyr 
755 760 765 

AAT GCA AAA CAT GAA ACA GTA AAT GTG CCA GGT ACG GGT TCC TTA" TGG 
Asn Ala Lys His Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu. Trp 
770 775 780 

CCG CTT TCA GCC CAA AGT CCA ATC GGA AAG TGT GGA GAG CCG AAT CGA 
Pro Leu Ser Ala Gin Ser Pro He Gly Lys Cys Gly Glu Pro Asn Arg 
785 790 795 800 

TGC GCG CCA CAC CTT GAA TGG AAT CCT GAC TTA GAT TGT TCG TGT AGG 
Cys Ala Pro His Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg 
805 810 815 

GAT GGA GAA AAG TGT GCC CAT CAT TCG CAT CAT TTC TCC TTA GAC ATT 
Asp Gly Glu Lys Cys Ala His His Ser His His PheSer Leu Asp He 
820 825 830 

GAT GTA GGA TGT ACA GAC TTA AAT GAG GAC CTA GGT GTA TGG GIG ATC 
Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val He 
835 840 845 
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TTT AAG ATT AAG ACG CAA GAT GGG CAC GCA AGA CTA GGG AAT CTA GAG 
Phe £ys lie Lys Thr Gin Asp Qly His Ala Arg Leu Gly Asn. Leu Glu 

850 ess , . 86 °: .'. . . 

TTT CTG GAA GAG AAA CCA TTA GTA - GGA GAA GCG CTA GCT. CGT ; GTG AAA 
Phi lIu Su~ Glu Lys Pro Leu Val Gly Glu 'Ala . Leu Ala A^r Val Lys 

865 870 875 ; , . a ?V : 

arA rrr GAG AAA AAA TGG AGA GAC AAA CGTGAA~AAA TTG GAA . TGG GAA 
Arg Ala Glu Ly£ Lys Trp Arg Asp Lys Arg Glu Lys Leu Glu Trpdu 
- 885 890 895 

ACA AAT ATC GTT TAT AAA GAG GCA AAA GAA TCT GTA GAT QCT TTA TTT 
T~nr A^n III Sal Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe 
900 905 910 

GTA AAC TCT CAA TAT GAT CAA TTA CAA GCG GAT ACG AAT ATT GCC ATG 
Sat A^n Ser Gin Tyr Asp Gin Leu Gin Ala Asp Thr Asn He Ala Met 
915 920 9 25 

ATT CAT GCG GCA GAT AAA CGT GTT CAT AGC ATT CGA GAA GCT TAT CTG 
Se Ss Ma Ala Asp Lys Arg Val His Ser lie Arg Glu Ala Tyr Leu 
930 "5 940 

CCT GAG CTG TCT GTG ATT CCG GGT GTC AAT GCG GCT ATT TTT GAA GAA 
Pto SS Su Sir Val lie Pro Gly Val Asn Ala Ala lie Phe Glu Glu 
945 950 955 3t>u 

TTA GAA GGG CGT ATT TTC ACT GCA TTC TCC CTA TAT GAT GCG AGA AAT 
llu SS Gly Arg lie Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn 
965 970 

GTC ATT AAA AAT GGT GAT TTT AAT AAT GGC TTA TCC TGC TGG AAC GTG 
?al lie Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn Val 
980 985 990 

AAA GGG CAT GTA GAT GTA GAA GAA CAA AAC AAC CAA CGT TCG GTC CTT 
Sly As Val Asp Val Glu Glu Gin Asn Asn Gin Arg Ser Val Leu 
* ' 1000 1005 



995 



GTT GTT CCG GAA TGG GAA GCA GAA GTG TCA CAA GAA GTT CGT GTC TGT 
Val Val Pro Glu Trp Glu Ala Glu Val Ser Gin Glu Val Arg VaTCys 

ioio * io 15 1020 ■ 

rrr CGT CGT GGC TAT ATC CTT CGT GTC ACA GCG TAC AAG GAG GGA TAT " 
Pro Sy Arg Gly Sr lie Leu Arg Val Thr Ala Tyr Lys Glu Gly .Jjr 
1025 1030 

GGA GAA GGT TGC GTA ACC ATT CAT GAG ATC GAG AAC AAT ACA GAC GAA 
gS Su" Gly Cys Val Thr lie His Glu lie Glu Asn Asn Thr Asp Glu 
10 45 1°50 

CTG AAG TTT AGC AAC TGC GTA GAA GAG GAA ATC TAT CCA AAT AAC ACG 
Su Lys Phe Ser Asn Cys Val Glu Glu Glu lie Tyr Pro Asn Asn Thr 
1060 1055 . 1070 

GTA ACG TGT AAT GAT TAT ACT GTA AAT CAA GAA GAA TAC GGA GGT GCG 
Sal Thr Cys Asn Asp Tyr Thr Val Asn Gin Glu Glu Tyr Gly Gly-Ala 

1080 1085 



107 5 
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TAC ACT TCT CGT AAT CGA GGA TAT AAC GAA GCT COT TCC GTA CCA GCT 
Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Glu Ala Pro Ser Val Pro Ala. 
1090 1095 1100 

GAT TAT GCG TCA GTC TAT GAA GAA AAA TCG TAT AC A ^AT GGA CGA AGA 
Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg. Arg 
1105 1110 1115 1120 



GAG AAT CCT TGT GAA TTT AAC AGA GGG TAT AGG GAT TAC ACG CCA CTA.: 
q-^^ y" -» - -- * r*i, r nvr-v- a >-/-t ac?^ T\rr Thr Fro Leu 

1125 



GAG AAT CCT TGT GAA TTT ^ ™* ««« — . . . f 

Glu Asn Pro Cys Glu Phe Asn Arg Gly Tyr Arg Asp Tyr Thr Pro Leu 

1130 1135 



CCA GTT GGT TAT GTG ACA AAA GAA TTA GAA TAC TTC CCA GAA ACC GAT 
Pro Val Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro Glu Thr Asp 

. .; H40 H45 use - 

AAG GTA TGG ATT GAG ATT GGA GAA ACG GAA GGA ACA TTT ATC GTG GAC 
£ys SS £5 lie Glu lie Gly Glu Thr Glu Gly Thr Phe lie Val Asp 

11S5 use n 65 

AGC GTG GAA TTA CTC CTT ATG GAG GAA 
Ser Val Glu Leu Leu Leu Met Glu Glu 
1170 H75 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1177 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Asp Asn Asn Pro Asn lie Asn Glu Cys lie Pro Tyr Asn Cys Leu 
15 10 15 

Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg He Glu Thr Gly 
20 25 30 

Tvr Thr Pro He Asp He Ser Leu Ser Leu Thr Gin Phe Leu LelTSer 
35 40 45 . . 

Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp He He 
50 55 60 

Trp Gly He Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin lie 
. 65 70 75 80 

Glu Gin Leu He Asn Gin Arg He Glu Glu Phe Ala Arg Asn Gin Ala 
85 • 90 95 

He Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin He Tyr Ala Glu 
100 105 11° 

Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Arg Glu 
115 120 125 

Glu Met Arg He Gin Phe Asn Asp Met Asn Ser Ala Leu Thr Thr Ala 
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c 



130 135 



140 



He Pro Leu Phe Ala Val Gin Asn Tyr Gin Val Pro Leu Leu Ser Val 
145 150 155 160 

5 . ~ :' 

Tyr Val Gin Ala Ala Asn Leu His Leu Ser Val Leu Arg. Asp Val Ser 
165 • 170 175 

Val Phe Gly Gin Arg Trp Gly Phe Asp Ala Ala Thr lie Asn Ser Arg 

10 180 185 . • /.I. 

Tyr Asn Asp Leu Thr Arg Leu He Gly Asn Tyr Thr Asp Tyr Ala Val 
195 200 205 

15 Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg 
-210 215 220 



20 



Asp Trp Val Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 
225 230 235 240 

Leu Asp He Val Ala Leu Phe Pro Asn Tyr Asp Ser Arg Arg Tyr Pro 
245 250 255 

He Arg Thr Val Ser Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val 
25 260 265 270 

Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly lie Glu 
275 280 285 

30 Arg Ser He Arg Ser Pro His Leu Met Asp He Leu Asn Ser He Thr 

290 295 300 



35 



50 



He Tyr Thr Asp Ala His Arg Gly Tyr Tyr Tyr Trp Ser Gly His Gin 
305 .310 315 320 

He Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 
325 330 335 



Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg He Val Ala 
40 340 ' 345 350 

Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr_Arg 
355 360 365 

45 Arg Pro Phe Asn He Gly He Asn Asn Gin Gin Leu Ser Val Leu Asp 
370 375 380 

Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala Val 
385 390 395 400 



Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu He Pro Pro Gin 
405 410 415 



Asn Asn Asn Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser His 
55 420 425 430 

Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser' Val Ser He He 
435 440 445 

60 Arg Ala Pro Met Phe Ser Trp Thr His Arg Ser Ala Thr Pro Thr Asn 
450 455 460 
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10 



25 



40 



C C 



Thr He Asp Pro Glu Arg lie Thr Gin lie Pro Leu Val Lys Ala His 

465 ' 470 475 480 

Thr Leu Gin Ser Gly Thr Thr Val Val Arg Gly Pro Gly Phe Thr Gly 

485 ■ 490' ; ; 495 

Gly Asp lie Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Tyr Thr ile 

500 " 505 , ' , ; - 510^; 

Val Asn Ile Asn Gly Gin Leu Pro Gin' Arg- Tyr Arg Ala Arg lie Arg 

515 520 525 

Tyr Ala Ser Thr Thr Asn Leu Arg Ile Tyr Val Thr Val Ala Gly Glu 

15 530 . 535 540 

Arg Ile Phe Ala Gly Gin Phe Asn Lys Thr Met Asp Thr Gly Asp Pro 

545 550 555 560 

20 Leu Thr Phe Gin Ser Phe Ser Tyr Ala Thr Ile Asn Thr Ala Phe Thr 

565 570 575 



Phe Pro Met Ser Gin Ser Ser Phe Thr Val Gly Ala Asp Thr Phe Ser 
580 585 590 

Ser Gly Asn Glu Val Tyr Ile Asp Arg Phe Glu Leu lie Pro Val Thr 
595 600 605 

Ala Thr Leu Glu Ala Glu Tyr Asn Leu Glu Arg Ala Gin Lys Ala Val 
30 610 615 620 

Asn Ala Leu Phe Thr Ser Thr Asn Gin Leu Gly Leu Lys Thr Asn Val 
62 5 63 0 63 5 640 

35 Thr Asp Tyr His Ile Asp Gin Val Ser Asn Leu Val Thr Tyr Leu Ser 

645 650 655 



Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys 
660 665 670 

His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Ser Asn 
675 680 685 



Phe Lys Asp He Asn Arg Gin Pro Glu Arg Gly Trp Gly Gly Ser Thr 
45 690 ' 695 700 

Gly Ile Thr Ile Gin Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val 
705 710 715 720 

50 Thr Leu Ser Gly Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin 

725 730 735 

Lvs Ile Asp Glu Ser Lys Leu Lys Ala Phe Thr Arg Tyr Gin Leu Arg 
740 " 745 750 

Glv Tvr Ile Glu Asp Ser Gin Asp Leu Glu Ile Tyr Leu Ile Arg Tyr 
755 760 765 

Asn Ala Lys His Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu_Trp 
60 770 775 780 
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Pro Leu Ser Ala Gin Ser Pro lie Sly Lys Cys Gly Glu Pro! .Asn Arg 
785- ' 790 • /' ■ -7.95 ; ;.; ; , ?a ° 

Cys Ala Pro His Leu Glu Trp Asn- Pro Asp Leu: Asp Cys v Ser Cys Arg 

805 , . 810 ■ • . . I'://- 

Asp GlySGlu Lys Cys Alt His His Ser '^m^Kf^M-^^y' 
'. 820 . 825 , ; . .830 

Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly V41. Trp V^'l II*; • 

' 835,' : - . ■ A ; - )f>S 

Phe Lys lie Lys Thr Gin Asp Gly His Ala Arg Leu Gly Asn Leu Glu 
850 855 860 

Phe Leu 'Glu Glu Lys Pro Leu Val Gly Glu Ala Leu Ala" Arg Val Lys 
865 870 • • 875 880 

Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Glu Trp Glu 
885 890 89b 

Thr Asn He Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe 
900 905 91° 

Val Asn Ser Gin Tyr Asp Gin Leu Gin Ala Asp Thr Asn He Ala Met 
915 920 9^5 

He His Ala Ala Asp Lys Arg Val His Ser lie Arg Glu Ala Tyr Leu 
930 93 5 940 

Pro Glu Leu Ser Val He Pro Gly Val Asn Ala Ala He Phe Glu Glu 
945 950 955 960 

Leu Glu Gly Arg He Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn 
965 970 975 

Val He Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn Val 
980 985 990 

Lvs Gly His Val Asp Val Glu Glu Gin Asn Asn Gin Arg Ser Val Leu 
995 1000 1005 

Val Val Pro Glu Trp Glu Ala Glu Val Ser Gin Glu Val Arg Vai-Cys 
1010 1015 1020 

Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr- 
1025 1030 1035 1040 

Gly Glu Gly Cys Val Thr He His Glu He Glu Asn Asn Thr Asp Glu 
10 45 1050 1055 

Leu Lys Phe Ser Asn Cys Val Glu Glu Glu He Tyr Pro Asn^Asn Thr 



1060 



1065 1070 



Val Thr Cys Asn Asp Tyr Thr Val Asn Gin Glu Glu Tyr Gly Gly Ala 
1075 1080 1085 

Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Glu Ala Pro Ser Val Pro Ala 
1090 1095 HOO . 

Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg Arg 
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1105 1110 1115 * . ; ^ 

Glu Asn Pro Cys Glu Phe Asn Arg Gly Tyr Arg Asp Tyr Thr Pfeieu 
1125 113;0 1135 

Pro Val Gly Tyr Val Thr Ly» <»ft W W^^JS^^" 1 

1140 • ' - ■ ' > ; i*45;;t, : . •„ ..;;/ > #^;^r;^: - 

Lys Val Trp He Glu He Gly Glu Thr :Glu.Gly Thr" Phe lie Val Asp 

\ ,ii55 • - .1160 ■ J: >.. m|; ;"• 

Ser Val Glu Leu Leu Leu Met Glu Glu 
1170 H75 

(2) INFORMATION FOR SEQ ID NO:13 : 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 3531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1. .3531 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATG GAT AAC AAT CCG AAC ATC AAT GAA TGC ATT CCT TAT AAT TGT TTA 
Met Asp Asn Asn Pro Asn He Asn Glu Cys He Pro Tyr Asn Cys Leu 
15 10 15 

AGT AAC CCT GAA GTA GAA GTA TTA GGT GGA GAA AGA ATA GAA ACT GGT 
Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg He Glu Thr Gly 

20 25 3*0 

TAC ACC CCA ATC GAT ATT TCC TTG TCG CTA ACG CAA TTT CTT TTG AGT 
Tyr Thr Pro He Asp He Ser Leu Ser Leu Thr Gin Phe Leu Leu Ser 
35 40 45 

GAA TTT GTT CCC GGT GCT GGA TTT GTG TTA GGA CTA GTT GAT ATA ATA 
Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp ITS He 

50 , 55 6.0 . . .- 

TGG GGA ATT TTT GGT CCC TCT CAA TGG GAC GCA TTT CTT GTA CAA ATT 
Trp Gly He Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin He 
65 70 75 80 

GAA CAG TTA ATT AAC CAA AGA ATA GAA GAA TTC GCT AGG AAC CAA GCC 
Glu Gin Leu He Asn Gin Arg He Glu Glu Phe Ala Arg Asn Gin Ala 
85 90 95 

ATT TCT AGA TTA GAA GGA CTA AGC AAT CTT TAT CAA ATT TAC GCA GAA 
He Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin He Tyr Ala Glu 
100 105 HO 

TCT TTT AGA GAG TGG GAA GCA GAT CCT ACT AAT CCA GCA TTA AGA GAA 
Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Axs Glu 
115 120 125 
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GAG ATG CGT ATT CAA TTC AAT GAC ATG AAC &GT GCC CTT ACA. ACC GCT 
Glu Met Arg lie Gin Phe Asn Asp Met Asn Ser ..ffl4i/l*e^^r;-13ir Ala 
130 » 135 • 140} V" 

ATT CCT CTT TTT GCA GTT CAA AAT TAT CAA GTT CCT CTT TTA TC A GTA 
lie; Pro Leu Phe Ala Val Gin Asn Tyr Gin Vai ; Pro Leu, hen, Ser Val 

145 - : 150 /, 15S- . - 

TAT GTT CAA GCT GCA AAT TTA CAT TTA TGA GTT TTG AGA GAT • GTT TCA 
Tyr Val Gin Ala Ala Asn Leu His Leu Ser - Val Leu &g-Asp Val Ser 

: 165 170 -.v^-y:.. -y 

' , • •. - • :■ ' - " - ' y. t 'T - - 

GTG TTT GGA CAA AGG TGG GGA TTT GAT GCC GCG ACT ATC AAT AGT CGT 
Val Phe Gly Gin Arg Trp Gly Phe Asp Ala Ala Thr He „ Asn Ser Arg 
180 185 190 

TAT AAT GAT TTA ACT AGG CTT ATT GGC AAC TAT ACA GAT CAT GCT GTA 
Tyr Asn Asp Leu Thr Arg Leu He Gly Asn Tyr Thr Asp His Ala Val 
195 200 205 

CGC TGG TAC AAT ACG GGA TTA GAG CGT GTA TGG GGA CCG GAT TCT AGA 
Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg 
210 215 220 

GAT TGG ATA AGA TAT AAT CAA TTT AGA AGA GAA TTA ACA CTA ACT GTA 
Asp Trp He Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 
22 5 23 0 235 2 40 

TTA GAT ATC GTT TCT CTA TTT CCG AAC TAT GAT AGT AGA ACG TAT CCA 
Leu Asp He Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr Pro 
245 250 255 

ATT CGA ACA GTT TCC - CAA TTA ACA AGA GAA ATT TAT ACA AAC CCA GTA 
He Arg Thr Val Ser Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val 
260^ 265 270 

TTA GAA AAT TTT GAT GGT AGT TTT CGA GGC TCG GCT CAG GGC ATA GAA 
Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly He Glu 
275 280 285 

GGA AGT ATT AGG AGT CCA CAT TTG ATG GAT ATA CTT AAC AGT ATA ACC 
Gly Ser He Arg Ser Pro His Leu Met Asp He Leu Asn Ser He Thr 
290 295 300 

ATC TAT ACG GAT GCT CAT AGA GGA GAA TAT TAT TGG TGA GGG CAT CAA 
He Tyr Thr Asp Ala His Arg Gly Glu Tyr Tyr Trp Ser Gly His Gin 
305 310 . 315 ■ 320 

ATA ATG GCT TCT CCT GTA GGG TTT TCG GGG CCA GAA TTC ACT TTT CCG 
He Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 
325 330 335 ( 

CTA TAT GGA ACT ATG GGA AAT GCA GCT CCA CAA CAA CGT ATT GTT GCT 
Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg He Val Ala 
3 40 3 45 3 50 

CAA CTA GGT CAG GGC GTG TAT AGA, ACA TTA TCG TCC ACT TTA TAT AGA 
Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr Arg 
355 360 365 

AGA CCT TTT AAT ATA GGG ATA AAT AAT CAA CAA CTA TCT GTT CTT GAC 
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Arg Pro Phe Asn lie Gly lie Asn Ash Gin Gin Leu Ser Vat Leu" Asp 
370 375 . . . . 3 80 . . ' . 

GGG ACA GAA TTT GCT TAT GGA ACC TCC TCA AAT TTG CCA TGC GCT G 'T A 
Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Ash Leu Pro Sef Ala ^Vaf 
385 ' / 390. . : . 'v .,: "'3^5: /. a: ; i ; ^ ; 

TAG AGA AAA AGC GGA ACG GTA GAT TCG CTG GAT GAA ATA CCG CCA CAG 
Tyr htg Lys Ser Gly Thf: Val Asp Ser Leu Asp Glu lie Pro' Pro^Glrr 

405 ' 410 - - :> 41J5, 7 - 

AAT AAt AAG GTG CCA CCT AGG MA M3GA^^ 

Asn Asn Asn Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser His 
42 0 425 43 0 

GTT TCA ATG TTT CGT. TCA GGC TTT AGT AAT AGT AGT GTA t AGT , ATA ATA 
Val Ser Met Phe Arg Ser Gly Phe Ser Ash " Ser Ser Val Ser lie lie 
435 440 445 

AGA GCT CCA ATG TTT TCT TGG ACG CAC CGT AGT GCA ACC CCT ACA AAT 
Arg Ala Pro Met Phe Ser Trp Thr His Arg Ser Ala Thr Pro Thr Asn 
450 455 460 

ACA ATT GAT CCG GAG AGG ATT ACT CAA ATA CCA TTG GTA AAA GCA CAT 
Thr He Asp Pro Glu Arg He Thr Gin He Pro Leu Val Lys Ala His 
465 470 475 480 

ACA CTT CAG TCA GGT ACT ACT GTT GTA AGA GGG CCC GGG TTT ACG GGA 
Thr Leu Gin Ser Gly Thr Thr Val Val Arg Gly Pro Gly Phe Thr Gly 
485 490 495 

GGA GAT ATT CTT CGA CGA ACA AGT GGA GGA CCA TTT GCT TAT ACT ATT 
Gly Asp He Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Tyr Thr He 
500 505 510 

GTT AAT ATA AAT GGG CAA TTA CCC CAA AGG TAT CGT GCA AGA ATA CGC 
Val Asn He Asn Gly Gin Leu Pro Gin Arg Tyr Arg Ala Arg He Arg 
515 '520 525 

TAT GCC TCT ACT ACA AAT CTA AGA ATT TAC GTA ACG GTT GCA GGT GAA 
Tyr Ala Ser Thr Thr Asn Leu Arg He Tyr Val Thr Val Ala Gly Glu 
530 535 540 

CGG ATT TTT GCT GGT CAA TTT AAC AAA ACA ATG GAT ACC GGT GAC . CCA 
Arg He Phe Ala Gly Gin Phe Asn Lys Thr Met Asp Thr- Gly Asp Pro 
545 550 555 560 

TTA ACA TTC CAA TCT TTT AGT TAC GCA ACT ATT AAT ACA GCT TTT ACA 
Leu Thr Phe Gin Ser Phe Ser Tyr Ala Thr He Asn Thr Ala Phe Thr 
565 570 575 

TTC CCA ATG AGC CAG AGT AGT TTC ACA GTA GGT GCT GAT ACT TTT AGT 
Phe Pro Met Ser Gin Ser Ser Phe Thr Val Gly Ala Asp Thr Phe Ser 
580 585 590 

TCA GGG AAT GAA GTT TAT ATA GAC AGA TTT GAA TTG ATT CCA GTT ACT 
Ser Gly Asn Glu Val Tyr He Asp Arg Phe Glu Leu He Pro Val Thr 
595 600 605 

GCA ACA TTT GAA GCA GAA TAT GAT TTA GAA AGA GCA CAA AAG GCG GTG 
Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gin Lys Ala Val 
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610 615 '6;20 

AAT GCG CTG TTT ACT TCT ATA AAC CAA ATA GGG ATA AAA ACA GAT GTG 

Asn Ala Leu Phe Thr Ser He Asn. Gin lie Gly lie Lys Thr Asp Val 

625 63 0 635 . .... -. 640 



ACG GAT TAT CAT ATT GAT CAA GTA TCC AAT TTA^GTG GAT TGT TTA vTCA,; 
Thr Asp Tyr His lie Asp Gin Val Ser Asn Leii Val Asp Cys Leu-Ser 

645 ■ 650. ■ '"■-■'■W: .,0s:. 

GAT GAA TTT TGT CTG GAT GAA AAG CGA GAA TTG TCC GAG AAA GTC AAA 
Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys' 
660 " 665 670 

CAT GCG AAG CGA CTC AGT GAT GAG CGG AAT TTA CTT CAA GAT. CCA AAC 
His Ala Lys Arg Leu Ser Asp Glu < Arg Asn Leu Leu Gin Asp Pro Asn 
675 680 685 

TTC AAA GGC ATC AAT AGG CAA CTA GAC CGT GGT TGG AGA GGA AGT ACG 
Phe Lys Gly He Asn Arg Gin Leu Asp Arg Gly Trp Arg Gly Ser Thr 
690 695 700 

GAT ATT ACC ATC CAA AGA GGA GAT GAC GTA TTC AAA GAA AAT TAT GTC 
Asp He Thr He Gin Arg Gly Asp Asp Val Phe Lys Glu Asn Tyr Val 
705 710 715 720 

ACA CTA CCA GGT ACC TTT GAT GAG TGC TAT CCA ACA TAT TTG TAT CAA 
Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin 
725 730 735 

AAA ATC GAT GAA TCA AAA TTA AAA GCC TTT ACC CGT TAT CAA TTA AGA 
Lys He Asp Glu Ser Lys Leu Lys Ala Phe Thr Arg Tyr Gin Leu Arg 
740 745 750 

GGG TAT ATC GAA GAT AGT CAA GAC TTA GAA ATC TAT Tin ATT CGC TAC 
Glv Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He Arg Tyr 
755 760 765 

AAT GCA AAA CAT GAA ACA GTA AAT GTG CCA GGT ACG GGT TCC TTA TGG 
Asn Ala Lys His Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu Trp 
770 775 780 

CCG CTT TCA GCC CAA AGT CCA ATC GGA AAG TGT GGA GAG CCG AAT" CGA 
Pro Leu Ser Ala Gin Ser Pro He Gly Lys Cys Gly Glu Pro Asn. Arg 
785 790 795 800^ 

TGC GCG CCA CAC CTT GAA TGG AAT CCT GAC TTA GAT TGT TCG TGT AGG 
Cvs Ala Pro His Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg 
805 810 815 

GAT GGA GAA AAG TGT GCC CAT CAT TCG CAT CAT TTC TCC TTA GAC ATT 
Asp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu Asp He 
820 825 830 

GAT GTA GGA TGT ACA GAC TTA AAT GAG GAC CTA GGT GTA TGG GTG ATC 
Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val He 
835 840 ' 845 

TTT AAG ATT AAG ACG CAA GAT GGG CAC GCA AGA CTA GGG AAT CTA, GAG 
Phe Lys He Lys Thr Gin Asp Gly His Ala Arg Leu Gly Asn Leu Glu 
850 855 860 
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TTT CTC GAA GAG AAA CCA TTA. GTA GGA GAA GCG CTA- GCT CGT- GTG , AAA' 
Phe Leu Glu Glu Lys Pro Leu Val Gly Glu Ala Leu *la Arg Val Lys 

865 870 §75. f ; : 880 ; 

AGA GCG GAG AAA AAA TGG AGA GAC AAA CGT GAA -AAA TTG ••GAA TGG ^GAA .. 
Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu- Lys;- Leu Glu Trp Glu - 
885' 8.90 .■ ' ; _ "8?5 ' 

ACA AAT ATC GTT TAT AAA GAG GCA AAA GAA TCT . GTA GAT. GCT TTA TTT 
Thr Asn He Val Tyr Lys Glu Ala Lys Glu.Ser Val Asp.. Ala- Leu; Phe,, 
goo ' " 905 '910' 

GTA AAC TCT CAA TAT GAT CAA TTA CAA GCG GAT ACG AAT ATT GCC ATG 
Val Asn Ser Gin Tyr Asp Gin Leu Gin Ala Asp Thr Asn lie Ala Met 
915 • 920 ' 925 

ATT CAT GCG GCA GAT AAA CGT GTT CAT AGC ATT CGA GAA GCT TAT CTG 
He His Ala Ala Asp Lys Arg Val His Ser tie Arg Glu Ala Tyr Leu 
930 935 940 

CCT GAG CTG TCT GTG ATT CCG GGT GTC AAT GCG GCT. ATT TTT GAA GAA 
Pro Glu Leu Ser Val He Pro Gly Val Asn Ala Ala lie Phe Glu Glu 
945 950 955 960 

TTA GAA GGG CGT ATT TTC ACT GCA TTC TCC CTA TAT GAT GCG AGA AAT 
Leu Glu Gly Arg He Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn 
965 970 975 

GTC ATT AAA AAT GGT GAT TTT AAT AAT GGC TTA TCC TGC TGG AAC GTG 
Val He Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn Val 
980 985 990 

AAA GGG CAT GTA GAT GTA GAA GAA CAA AAC AAC CAA CGT TCG GTC CTT 
Lys Gly His Val Asp Val Glu Glu Gin Asn Asn Gin Arg Ser Val Leu 
995 1000 1005 

GTT GTT CCG GAA TGG GAA GCA GAA GTG TCA CAA GAA GTT CGT GTC TGT 
Val Val Pro Glu Trp Glu Ala Glu Val Ser Gin Glu Val Arg Val Cys 
1010 1015 1020 

CCG GGT CGT GGC TAT ATC CTT CGT GTC ACA GCG TAC AAG GAG GGA TAT 
Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly" Tyr 
1025 1030 1035 . . .1040 

GGA GAA GGT TGC GTA ACC ATT CAT GAG ATC GAG AAC AAT ACA GAC GAA" 
Glv Glu Gly Cys Val Thr He His Glu He Glu Asn Asn Thr Asp Glu 
1045 1050 1055 

CTG AAG TTT AGC AAC TGC GTA GAA GAG GAA ATC TAT CCA AAT AAC ACG 
Leu Lys Phe Ser Asn Cys Val Glu Glu Glu He Tyr Pro Asn Asn Thr 
1060 1065 1070 

GTA ACG TGT AAT GAT TAT ACT GTA AAT CAA GAA GAA TAC GGA GGT GCG 
Val Thr Cys Asn Asp Tyr Thr Val Asn Gin Glu Glu Tyr Gly Gly Ala 
1075 1080 1085 

TAC ACT TCT CGT AAT CGA GGA TAT AAC GAA GCT CCT TCC GTA CCA GCT 
Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Glu Ala Pro Ser Val Pxa Ala 
1090 1095 1100 
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20 



30 



35 



45 



50 



GAT TAT GCG TCA GTC TAT gAA =AA AAA go TAT AcX ™ 

Asp Tyr Ala Ser Val Tyr Glu Glu -.Lys Ser i^-,- * : ,*y * il 2 0 
1105 ' 1110 ' ' .. 




r GGT TAT GTG ACA AAA GAA .TTA gAA |aC ;TTC |f |f 
I Giy tyr -Val Thr Lys ,Gli ;^lu ^yr ; . Phe.-P-ro Glu T^r Asp, 



CCA GTT 
10 Pro Val 

15 1155 1160 llb 

* AGC GTG GAA TTA CTC CTT ATG GAG GAA 
Ser Val Glu Leu Leu Leu Met Glu Glu 
1170 ll 75 



(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1177 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Asp Asn Asn Pro Asn lie Asn Glu Cys He Pro Tyr Asn Cys Leu 

1 5 

Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg lie Glu Thr Gly 

20 25 
Tyr Thr Pro lie Asp He Ser Leu Ser Leu Thr Gin Phe Leu Leu Ser 
40 35 40 

Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp 13*11. 



50 



55 



Trp Gly lie Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu- Val Gin lie 



65 70 



Glu Gin Leu lie Asn Gin Arg He Glu Glu Phe Ala Arg Asn Gin Ala 
85 90 

lie Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin lie Tyr Ala Glu 

100 105 
Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Arg Glu 

55 H5 120 125 

Glu Met Arg lie Gin Phe Asn Asp Met Asn Ser Ala ' Leu Thr Thr Ala 
130 135 I 40 ^ 

»n x7=i rin Ac:n Tvr Gin Val Pro Leu Leu Ser Val 
60 He Pro Leu Phe Ala Val Gin Asn ryr 

145 KO 155 



3 360 



,3408 



3*56; 



3504 



3531 
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Tyr Val Gin Ala' Ala Asn Leu His Leu Ser Val Leu Arg Asp ,Val- Ser r, 
165 170 

Val Phe Gly Gin Arg Trp Gly Phe Asp Ala Ala Thr lie Asn^er Arg^ 

■ 180 185 ■ . ,. ■ <:.::• ; , H, vv- 

Tyr Asn Asp Leu Thr Arg Leu lie Gly. Asn Tyr Thr Asp His Ala Val 

195 200 '"■ V v ;v ^ v 

Arg. Trp Tyr Asn Thr .Gly Leu Glu^rg Val Trp Gly Pro Asp ::Ser:;Arg ; 
210 215 

Asp Trp lie Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 
225 230 235 240 

Leu Asp He Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr Pro 
245 250 Z33 

lie Arg Thr Val Ser Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val 

Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly He Glu 
275 280 285 

Gly Ser lie Arg Ser Pro His Leu Met Asp He Leu Asn Ser lie Thr 
290 295 300 

He Tyr Thr Asp Ala His Arg Gly Glu Tyr Tyr Trp Ser Gly His Gin 
305 310 315 

lie Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 
325 • 330 - 5J: > 

Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg lie Val Ala 



340 



Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr Arg 
355 360 365 

Arg Pro Phe Asn lie Gly He Asn Asn Gin Gin Leu Ser Val Leu Asp 
370 375 380 _ 

Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala. .Val 



385 



390 



Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu He Pro Pro Gin 
405 410 

Asn Asn Asn Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser His 
420 425 430 

Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser He He 



435 



440 



Arg Ala Pro Met Phe Ser Trp Thr His Arg Ser Ala . Thr Pro Thr Asn 



450 



455 



Thr lie Asp Pro Glu Arg He Thr Gin He Pro Leu Val Lys Ala. His 
465 470 4/b 
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Thr Leu Gin Ser Gly Thr Thr Val Vai Arg Gly Pro Gly Phe Thr Gly 

485 490 ' . '•■ 

Gly Asp He Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Tyr Thr He 

500 505 ,.' . ' :s }.-■■ - . •, . - 

Val Asn lie Asn Gly Gin Leu Pro Gin Arg; Tyr Arg^iprg. IVe Arg : 

Tyr Ala Ser Thr Thr Ash. Leu Arg-. lie . Tyr . Vai ; .f^^M^'.^S: 
530 ' ' ; ' " 5 ' ' - 54a : X ■ " 

Arg lie Phe Ala Gly Gin Phe Asn Lys Thr Met Asp Thr Gly Asp Pro 
545 550 5 " 

Leu Thr Phe Gin Ser Phe Ser Tyr Ala Thr lie As*- Thr Ala Phe Thr 
565 570 

Phe Pro Met Ser Gin Ser Ser Phe Thr Val Gly Ala Asp Thr Phe Ser 

580 585 
Ser Gly Asn Glu Val Tyr lie Asp . Arg Phe Glu Leu lie Pro Val Thr 

Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gin Lys Ala Val 
610 615- 620 

Asn Ala Leu Phe Thr Ser lie Asn Gin He Gly He Lys Thr Asp Val 
625 630 635 

Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val Asp Cys Leu Ser 



645 

Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys 



660 

His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn 



675 



680 685 



Phe Lys Gly He Asn Arg Gin Leu Asp Arg Gly Trp Arg Gly Ser Thr 
" 695 700 



690 



Asp He Thr lie Gin Arg Gly Asp Asp Val Phe Lys Glu Asn Tyr- Val 
705 - 7 1° 715 - 

Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Oln- 
725 730 

Lys lie Asp Glu Ser Lys Leu Lys Ala Phe Thr Arg Tyr Gin Leu Arg 
740 745 

Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He Arg Tyr 
755 760 ' 03 

Asn Ala Lys His Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu Trp 
„ „ ^ 775 / o u 



770 



Pro Leu Ser Ala Gin. Ser Pro He Gly Lys Cys Gly Glu Pro Asn Arg 
785 7 90 795 — 

Cys Ala Pro His Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg 
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f 



805 810 .v ; 8p 



20 



60 



Asp Gly GluLysCys Ala His His Ser His His Phe Ser ^eu Asp lie 



820 



Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu^^'^Val J/g; 
■,- . 835 . : " 840; . . ^f»,,^,. .. ^. .v;... v . y 

Phe Lystrie Lys Thr Gin Asp Gly His Ala Ar,g Le^G|y.A:sn Le^a Glu, h 

: 850 ,• • ' ■ 855 . ■: : . ' 860 • . . -.^ . • .. 

Phe Leu' Glu Glu Lys Pro Leu Val Gly Glu Ala Leu Ala Arg^ValUys 

870 875 



15 Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Glu Trp Glu 



885 



Thr Asn lie Val Tyr Lys Glu Ala Lys Glu Ser Val Asp A^a Leu Phe 
900 905 ylU 

Val Asn Ser Gin Tyr Asp Gin Leu Gin Ala Asp Thr Asn lie Ala Met 
915 920 92b 

He His Ala Ala Asp Lys Arg Val His Ser lie Arg Glu Ala Tyr Leu 
25 930 935 940 

Pro Glu Leu Ser Val He Pro Gly Val Asn Ala Ala lie Phe Glu Glu 
945 950 9bb 

30 Leu Glu Gly Arg lie Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn 

965 970 

Val lie Lys Asn Gly -Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn Val 
980 985 
35 Lys Gly His Val Asp Val Glu OluOln Asn Asn Gin ArgSer Val Leu 

Val Val Pro Glu Trp Glu Ala Glu Val Ser Gin Glu Val Arg Val Cys 
40 1010 ioi5 1020 

Pro Qly Arg Gly Tyr Il^Leu Arg Val Thr Al^Tyr Lys Glu Gly Ty^ 

45 Gly Glu Gly Cys Val Thr lie His Glu lie Glu Asn Asn Thr Asp Glu 

1045 10b0 

Leu Lys Phe Ser Asn Cys Val Glu Glu Glu lie Tyr Pro Asn Asn Thr 
1060 1065 1°'° 

50 Val Thr Cys Asn Asp Tyr Thr Val Asn Gin Glu Glu Tyr Gly Gly Ala 
1075 1080 1085 

Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Glu Ala Pro Ser Val Pro Ala 
55 1090 ~ 1095 1100 

Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr' Asp Gly Arg Arg 
1105 IHO 1115 



Glu Asn Pro Cys Glu Phe Asn Arg Gly Tyr Arg Asp Tyr Thr Pro Leu 
1125 ll 30 - 
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Pro Val Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro GlU Thr, Asp, 
1140 1145 . , 

Lys Val Trp lie Glu lie Gly Glu Thr Glu Gly Thr > Phe, lie Val 
1155 1160 1165 .■ 

Ser Val Glu Leu Leu Leu Met Glu Glu \ 
1170 H75 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
• (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



TATCCAATTC GAACGTCATC 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



TTTAGTCATC GATTAAATCA 



(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) , TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATAATAAGAG CTCCAATGTT 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TACATCGTAG TGCAACTCTT 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : ^ : 
' (A) LENGTH: 20 base pairs 

(B) TYPE i nucleic acid; 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCiE ©ESCRrRTIOti: SEQ'<lb tiO:|9-: 
TCATGGAGAG CTCCTATGTT 

(2) INFORMATION FOR SEQ ID NO:20r 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : . 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTAACAAGAG CTCCTATGTT 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pa^irs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ACTACCAGGT ACCTTTGATG 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
ACTACCGGGT ACCTTTGATA 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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1 



18 



GAA TTT GTT CCC GGT GCT GGA TTT GTG TTA GGA CTA GIT GAT ATA ATA" 
Glu Phe Val Pro Gly Ala Gly Phe Vel Leu Gly Leu Val Asp lie lie 
50 • 55 60 

50 TGG GGA ATT TTT GGT CCC TCT CAA TGG GAC GCA TTT CTT GTA CAA ATT 

Trp Gly lie Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin lie 
65 70 75 80 

GAA CAG TTA ATT AAC CAA AGA ATA GAA GAA TTC GCT AGG AAC CAA GCC 
55 Glu Gin Leu He Asn Gin Arg He Glu Glu Phe Ala Arg Asn Gin Ala 

85 9° . y5 _ 

ATT TCT AGA TTA GAA GGA CTA AGC AAT TTT TAT CAA ATT TAC GCA GAA 
ill Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin lie Tyr Ala. Glu 
* ^ 1 nR ill) 



19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
ATTTGAGTAA TACTATGC 

5 . . -"J/.: . \ 

(2) INFORMATION. FOR SEQ ID NO :. 24.: \. • - v , y- : 

(i) SEQUENCE CHARACTERISTICS: ^ 
10 (A) LENGTH: 19 base pairs - ' / r , . . . , > V \ : 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

ATTACTCAAA TACCATTGG 

20 (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3534 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
30 (B) LOCATION: 1..3531 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATG GAT AAC AAT CCG AAC ATC AAT GAA TGC ATT CCT TAT AAT TGT TTA 
35 Met tsl ^n £n Pro Asn lie Asn Glu Cys lie Pro Tyr Asn Cys Leu 
1 5 10 lb 

ACT AAC CCT GAA GTA GAA GTA TTA GGT GGA GAA AGA ATA GAA ACT GGT 
Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg He Glu Thr Gly 
40 ~~ 20 25 30 

TAC ACC CCA ATC GAT ATT TCC TTG TCG CTA ACG CAA TTT CTT TTG AGT 
Tyr Thr Pro He Asp lie Ser Leu Ser Leu Thr Gin Phe Leu LeTT Ser 
35 . 40 45 . . . 

45 „ mmm ^n-^ mm a nn T\ nrn& Cl'W a AT ATA ATA" 192 



48 



96 



144 



240 



288 



336 
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TCT TTT AGA GAG TGG GAA GCA GAT CCT ACT , AAT CCA iQCA . TTA AGA GAA 
Ser Phe Arg Glu Trp Glu Ala ; Asp Pro Thr Asn Prd .|la Arg Glu 

115 120 125 ' 

GAG ATG CGT ATT CAA TTC AAT GAC . ATG AAC AGT GCC CTT AGA ACC GC J . 
Glu Met Arg lie Gin PhCAsri Asp Met Asn' Sefc JAla^ - Leu Thr Thr . Alai 
130 135^ *- ; ■ - IW: 

ATT CCT CTT TTT GCA GTT CAA AAT TAT CAA GTT CCT' CTT TTA TCA GTA 
lie Pro Leu Phe Ala Val Gin Asn Tyr Girt Val- Pro Leu Leu Ser; Va% 

145 ■ - i5X) - V " ■^•'■'155^. • - ^ 

TAT GTT CAA GCT GCA AAT TTA CAT TTA TCA GTT TTG AGA GAT GTT TCA 
Tvr Val Gin Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp Val Ser 
165 - 170 175 

GTG TTT GGA CAA AGG TGG GGA TTT GAT GCC GCG ACT ATC AAT AGT CGT 
Val Phe Gly Gin Arg Trp Gly Phe Asp Ala Ala Thr lie Asn Ser Arg 
180 ~ 185 190 

TAT AAT GAT TTA ACT AGG CTT ATT GGC AAC TAT AC A GAT CAT GCT GTA 
Tyr Asn Asp Leu Thr Arg Leu lie Gly Asn Tyr Thr Asp His Ala Val 
195 200 205 

CGC TGG TAC AAT ACG GGA TTA GAG CGT GTA TGG GGA CCG GAT TCT AGA 
Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg 
210 215 220 

GAT TGG ATA AGA TAT AAT CAA TTT AGA AGA GAA TTA ACA CTA ACT GTA 
Asp Trp He Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 
225 230 235 240 

TTA GAT ATC GTT TCT GTA TTT CCG AAC TAT GAT AGT AGA ACG TAT CCA 
Leu Asp He Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr Pro 
245 250 255 

ATT CGA ACA GTT TCC CAA TTA ACA AGA GAA ATT TAT ACA AAC CCA GTA 
He Arg Thr Val Ser Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val 
260 265 270 

TTA GAA AAT TTT GAT GGT AGT TTT CGA GGC TCG GCT CAG GGC ATA GAA 
Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly He Glu 
275 280 285 — 

AGA AGT ATT AGG AGT CCA CAT TTG ATG GAT ATA CTT AAC AGT ATA ACC 
Arg Ser lie Arg Ser Pro His Leu Met Asp He Leu Asn Ser He Thr 
290 " 295 300 

ATC TAT ACG GAT GCT CAT AGG GGT TAT TAT TAT TGG TCA GGG CAT CAA 
He Tyr Thr Asp Ala His Arg Gly Tyr Tyr Tyr Trp Ser Gly His Gin 
305 ~ 310 315 320 

ATA ATG GCT TCT CCT GTA GGG TTT TCG GGG CCA GAA TTC ACT TTT CCG 
He Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 
325 330 335 

CTA TAT GGA ACT ATG GGA AAT GCA GCT CCA CAA CAA CGT ATT GTT GCT 
Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg He Val Ala 
340 345 350 m 

CAA CTA GGT CAG GGC GTG ,TAT AGA ACA TTA TCG TCC ACT TTA TAT AGA 
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20 



25 



40 



45 



J 



Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu ,Tyr Arg 
355 360 • 365 

AGA CCTTTT AAT ATA GGG ATA AAT AAT CAA CAA CTA TCT GTT CTO GAC 
Arg Pro Phe Asn lie Gly lie Asn Asn Gin Gin Leu Ser. Val Leu Asp.. 
370 375 ; 3 «° . ' ,:, . 



GTT TCA ATG TTT CGT TCA GGC TTT ACT AAT AGT AGT GTA AGT ATA ATA 
Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser He lie 
435 440 

AGA GCT CCA ATG TTT TCT TGG ACG CAC CGT AGT GCA ACC CCT ACA AAT 
Arg Ala Pro Met Phe Ser Trp Thr His Arg Ser Ala Thr Pro Thr Asn 



450 



455 



11-52 



1-200 



GGG ACA GAA TTT GCT TAT GGA ACC TCC TCA AAT TTG CCA TC£ GCT GTA 
Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu. Pro Ser Ala Val 

10 385 ■ 390 ,395- ; .' . ■ ■ f *^ ... 

TAC AGA AAA AGC GGA ACG GTA GAT TCG CTG GAT GAA ATA CCG CCA CAG 1248 
Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu He Pro Pro Gin 
405 410 - 

15 AAT AAC AAC GTG CCA CCT AGG CAA GGA TTT AGT CAT CGA TTA AGC CAT 1296 

En En En Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser Hrs 
420 425 



1344 



13 92 



1440 



1488 



ACA ATT GAT CCG GAG AGG ATT ACT CAA ATA CCA TTG GTA AAA GCA CAT 
III Asp Pro Glu Arg He Thr Gin lie Pro Leu Val Lys Ala His 
30 465 ~ 4 ™ 475 

ACA CTT CAG TCA GGT ACT ACT GTT GTA AGA GGG CCC GGG TTT ACG GGA 
tS Leu ctn Ser Gly Thr Thr Val Val Arg Gly Pro Gly Phe Thr Gly 
485 490 4y 

35 GGA GAT ATT CTT CGA CGA ACA AGT GGA GGA CCA TTT GCT TAT ACT ATT 153 6 

Gly Asp lie Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Tyr Thr He 
500 505 

GTT AAT ATA AAT GGG CAA TTA CCC CAA AGG TAT CGT GCA AGA ATA CGC 
SS En He En Sy Gin Leu Pro Gin Arg Tyr Arg Ala Arg lie Arg 
515 520 525 " 

TAT GCC TCT ACT ACA AAT CTA AGA ATT TAC GTA ACG GTT GCA GGT. GAA 
Sr S Ser Sr Thr Asn Leu Arg He Tyr Val Thr Val Ala Gly Glu_ 
530 535 540 - 

CGG ATT TTT GCT GGT CAA TTT AAC AAA ACA ATG GAT ACC GGT GAC CCA 
Arg ill HI Ala Gly Gin Phe Asn Lys Thr Met Asp Thr Gly Asp Pro 
50 545 550 555 

TTA ACA TTC CAA TCT TTT AGT TAC GCA ACT ATT AAT ACA GCT TTT ACA 
leu Sr Phe oTn Ser Phe Ser Tyr Ala Thr He Asn Thr Ala Phe Thr 
" 565 570 575 

55 TTC CCA ATG AGC CAG AGT AGT TTC ACA GTA GGT GCT GAT ACT TTT AGT 177 6 

Phe Pro Met Ser Gin Ser Ser Phe Thr Val Gly Ala Asp Thr Phe Ser 
580 585 590 

60 TCA GGG AAT GAA GTT TAT ATA GAC AGA TTT GAA TTG ATT CCA GTT ACT 1824 

Ser Gly Asn Glu Val Tyr He Asp Arg Phe Glu Leu He Pro Val Thr 



1584" 



1632 



1680 



1728 
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595 600 605 . , 

GCA ACA TTT GAA GCA GAA TAT GAT TTA GAA AGA GCA CA$ AAG GCG GTG 
Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Glri Lys. Ala Val 

610 615 620 . ; .-/-y^-' 

AAT GGG CTG TTT ACT TCT ATA AAC CAA ATA GGG ATA AAA ACA GAT ' GTG , 
Asn Ala Leu Phe Thr Ser He Asn Gin He Gly Tie Lys ^Thr Asp ^1 

625 630 635 ' / ' - . *' \ 

ACG GAT TAT CAT ATT GAT CAA GTA TCC AAT TTA GTG GAT TGT TTA 'TGiA 
Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val Asp Cys Leu Ser 
645 650 655 

GAT GAA , TTT TGT CTG GAT GAA AAG CGA GAA TTG TCC GAG AAA GTC AAA 
Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys 
660 665 670 

CAT GCG AAG CGA CTC AGT GAT GAG CGG AAT TTA CTT CAA GAT CCA AAC 
His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn 
675 680 685 

TTC AAA GGC ATC AAT AGG CAA CTA GAC CGT GGT TGG AGA GGA AGT ACG 
Phe Lys Gly He Asn Arg Gin Leu Asp Arg Gly Trp Arg Gly Ser Thr 
690 695 700 

GAT ATT ACC ATC CAA AGA GGA GAT GAC GTA TTC AAA GAA AAT TAT GTC 
Asp He Thr He Gin Arg Gly Asp Asp Val Phe Lys Glu Asn Tyr Val 
705 710 715 720 

ACA CTA CCA GGT ACC TTT GAT GAG TGC TAT CCA ACA TAT TTG TAT CAA 
Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin 
725 - 730 735 

AAA ATC GAT GAA TCA AAA TTA AAA GCC TTT ACC CGT TAT CAA TTA AGA 
Lys He Asp Glu Ser Lys Leu Lys Ala Phe Thr Arg Tyr Gin Leu Arg 
740 745 750 

GGG TAT ATC GAA GAT AGT CAA GAC TTA GAA ATC TAT TTA ATT CGC TAC 
Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He Arg Tyr 
755 760 765 

AAT GCA AAA CAT GAA ACA GTA AAT GTG CCA GGT ACG GGT TCC TTA~TGG 
Asn Ala Lys His Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu. Trp 
770 775 780 

CCG CTT TCA GCC CAA AGT CCA ATC GGA AAG TGT GGA GAG CCG AAT CGA 
Pro Leu Ser Ala Gin Ser Pro He Gly Lys Cys Gly Glu Pro Asn Arg 
785 790 795 800 

TGC GCG CCA CAC CTT GAA TGG AAT CCT GAC TTA GAT TGT TCG TGT AGG 
Cys Ala Pro His Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg 
805 810 815 

GAT GGA GAA AAG TGT GCC CAT CAT TCG CAT CAT TTC TCC TTA GAC ATT 
Asp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu Asp He 
820 825 1 830 

GAT GTA GGA TGT ACA GAC TTA AAT GAG GAC CTA GGT GTA TGG GTG^ATC 
Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val He 
83 5 840 845 
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TTT AAG ATT AAG ACG GAA GAT GGG CAC GCA AGA CTA. GGG AAT » 1 ' 2 5 9 2 -/ 

£ys ne £ys Thr Gin Asp Gly His Ala Arg Lju Gly Asn Leu Glu 



2640 



#688 



2736 



2784 



283 2 



TTT CTC GAA GAG AAA CCA TTA GTA GGA GAA GCG CTA GCT CGT GTG AAA 
£e Leu Glu Glu Lys **o Leu Val Gly Glu Ala Leu Ala Arg Val Lys 

865 ■ 870 ; : • : u..;; • ; ; 

10 APA GCG GAG AAA AAA TGG AGA GAC AAA CGT GAA AAA TTG.GAA TGG GAA 

10 Ar.g Ala tu Lys Lys Trp Arg Asp, Lys Arg- Glu Lys ,.L fe u,Glu £p Glu. 

885 yyu 

ACA AAT ATC GTT TAT AAA GAG GCA AAA GAA TCT GTA GAT GCT TTA TTT 
15 - S A^n iS Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe 

900 

GTA AAC TCT CAA TAT GAT CAA TTA CAA GCG GAT ACG AAT ATT GCC ATG 
SS A^n Ser Gin Tyr Asp Gin Leu Gin Ala Asp Thr Asn lie Ala Met 
20 915 920 925 

ATT CAT GCG GCA GAT AAA CGT GTT CAT AGC ATT CGA GAA GCT TAT CTG 
Jle lis tit Ala Asp Lys Arg Val His Ser lie Arg Glu Ala Tyr Leu 
930 935 940 

25 CCT GAG CTG TCT GTG ATT CCG GGT GTC AAT GCG GCT ATT TTT GAA GAA 2880 

• pro Su Leu Ser Val He Pro Gly Val Asn Ala Ala He Phe Glu Glu 
945 950 955 

30 TTA GAA GGG CGT ATT TTC ACT GCA TTC TCC CTA TAT GAT GCG AGA AAT 

Leu Glu Gly Arg lie Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn 
965 970 

GTC ATT AAA AAT GGT GAT TTT AAT AAT GGC TTA TCC TGC TGG AAC GTG 
35 Sal lie Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn Val 

980 985 

AAA GGG CAT GTA GAT GTA GAA GAA CAA AAC AAC CAA CGT TCG GTC CTT 
^ Sy Ss Zl Lp Val Glu Glu Gin Asn Asn Gin Arg Ser Val Leu 
40 995 1000 1005 

GTT GTT CCG GAA TGG GAA GCA GAA GTG TCA CAA GAA GTT CGT GTC TGT 
511 Val Pro Glu Trp Glu Ala Glu Val Ser Gin Glu Val Arg Va^Cys 
1010 < 1015 I 020 - - - 

45 CCG GGT CGT GGC TAT ATC CTT CGT GTC ACA GCG TAC AAG GAG GGA TAT" 

Pro Gly Arg Gly Tyr lie Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr 
- 1025 1030 1035 

50 GGA GAA GGT TGC GTA ACC ATT CAT GAG ATC GAG AAC AAT ACA GAC GAA 

Gly Glu Gly Cys Val Thr lie His Glu He Glu Asn Asn Thr Asp Glu 
1045 1° 50 

CTG AAG TTT AGC AAC TGC GTA GAA GAG GAA ATC TAT CCA AAT AAC ACG 
55 "u £ys III Ser Q Asn Cys Val Glu Gl^Glu lie Tyr Pro AsnAsn Thr 

' GTA ACG TGT AAT GAT TAT ACT GTA AAT CAA GAA GAA TAC GGA GGT GCG 
Val iS Cys Asn Asp Tyr Thr Val Asn Gin Glu Glu Tyr Gly Gly. Ala 
60 1075 1080 1085 



2928 - 



2976 



3024 



3072 



3168 



3216 



3264 
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m ■ rrA TAT aac GAA GOT. CCT TCC GT?h CQA GCT - 

TAG ACT TCT CGT AAT CG A GG* TAT AAC ^ Ala Pro Ser V.&l Pro, Ala 

Tyr Thr Ser Arg Asn Arg c^iy iy 11Q0 

1090 luyD 

™ a rzvA aaa tcg tat/aca g&t g6a ' g$k , aga; 

1105 1110 , . , 

- m UP apa PRG TAT AGG GAT TAC ACG CCA..CTA 
GAG AAT CCT TGT GAA TTT AAC Ag GGG TAT £ ^-fcj* p*6^ 

Glu Asn Pro Cys Glu Phe Asn Arg ^ay v » ... ; 1135 

1125 iUU 

S S 5 5 3 S S Si SS ™ = S §S S. 

1140 114b 

- ° T t r s ss S S Si £ - S £ s S S Sp 

Lys Val Trp lie Glu lie fc.xy ^ g5 
1155 1160 

AGO GTG GAA TTA CTC CTT ATG GAG GAA TAG 
Ser Val Glu Leu Leu Leu Met Glu Glu 

1170 1175 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1177 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
Met Asp Asn Asn Pro Asn lie Asn Glu Cys lie Pro Tyr Asn Cys Leu 

Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg He Glu Thr Gly 

20 2b 

Tyr Thr Pro He Asp He Ser Leu Ser Leu Thr Gin Phe Leu Le^Ser 

35 - 40 " 

Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp lie He" 

50 55 
Trp Gly lie Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin II. 

Glu Gin Leu lie Asn Gin Arg He Glu Glu Phe Ala Arg Asn Gin Ala 

85 9U 

lie Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin He Tyr Ala Glu 
100 10b 

Ser Ph. «« Glu Trp Glu »1. «P Pro Thr Asn Pro Jl. L~ W«» 
115 1ZU 
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1 



35 



135 140 

130 1 . . 



50 



He Pro Leu Phe Ala Val Gin Asn Tyr Gin Val Pro Leu Leu Ser Val 
145 150 

Ty, val Gin Ma Ala **«M Bi * ^ K ^ ^ ' 

• 165 ••; 1/u ; .;. > 

v.i Phe G» Gin *>P TrpGlv ». Al, * , 

180 ' ' 18:5 . * ; , . : L - , ^.^ C:.. . 

». , Ton Thr Arg Leu lie Gly Asn Tyr Thr Asp His Ala Val 

Tyr Asn Asp Leu inr »ta 205 



195 200 
15 • Ar 9 Trp Tyr Asn Thr Gly Leu Glu Ar g ,Val Trp Gly Pro Asp Ser. Arg. 
210 ^ 



Asp Trp lie Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 
225 230 
20 Leu Asp He Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr Pro 



245 250 



He Arg Thr Val Ser Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val 



25 260 265 

Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly lie Glu 
275 280 
30 Arg Ser He Arg Ser Pro His Leu Met Asp He Leu Asn Ser lie Thr 
290 2yb 

» , 1a Ara Glv Tyr Tyr Tyr Trp Ser Gly His Gin 

lie Tyr Thr Asp Ala His Arg ^±y iyi *x x * 32Q 

305 310 315 

lie Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 
325 330 

L eu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg lie Val Ala 
40 340 345 

Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr_Arg 
355 360 
45 Arg Pro Phe Asn He Gly He Asn Asn Gin Gin Leu Ser . Val Leu Asp 
370 375 



Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala Val 
385 390 

Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu He Pro Pro Gin 



405 



Asn Asn Asn Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser His 
55 420 425 

Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser 'Val Ser lie He 



435 



440 



60 Ar g Al. Pro Met Phe Ser Trp Thr His Are Ser Al, Thr Pro Thr Asn 
450 4 " ^ * 
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Thr lie Asp Pro Glu Arg lie Thr Gin lie Pro Leu Val Lys Al^'Has- 
465 47G 475 * '"' -48.0 

Thr Leu Gin Ser Gly Thr Thr Val Val Arg Gly ;.. Pro- Gly #he Thr Gly. 

• '4 : 85' ' "%9>0 . .* ;y^>y ^495^ 

Gly Asp lie Leu Arg Arg Thr Ser Gly Gly Pro -Phe Ala Tyr Thr lie 

• 500' v 505 ' '/ f / • f ^51^>;., ' / 

Val Asn lie Asn Gly Gin Leu Pro \Gln Arg ,t^-M^^^%^^^.C^e^g^ 
515 520 525; . * 

Tyr Ala Ser Thr Thr Asn Leu Arg lie Tyr Val Thr Val Ala Gly Glu 
530 535 . 540 - 

Arg lie Phe Ala Gly Gin Phe Asn Lys Thr Met Asp Thr Gly Asp Pro 
545 550 555 560 

Leu Thr Phe Gin Ser Phe Ser Tyr Ala Thr lie Asn Thr Ala Phe Thr 
565 570 575 

Phe Pro Met Ser Gin Ser Ser Phe Thr Val Gly Ala Asp Thr Phe Ser, 
580 585 590 

Ser Gly Asn Glu Val Tyr lie Asp Arg Phe Glu Leu lie Pro Val Thr 
595 600 605 

Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gin Lys Ala Val 
610 615 620 

Asn Ala Leu Phe Thr Ser lie Asn Gin lie Gly He Lys Thr Asp Val 
625 630 635 640 

Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val Asp Cys Leu Ser 
645 650 ■ 655 

Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys 
660 665 670 

His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn 
675 680 685 

Phe Lys Gly lie Asn Arg Gin Leu Asp Arg Gly Trp Arg Gly Ser Thr 
690 ^ 695 700 

Asp He Thr He Gin Arg Gly Asp Asp Val Phe Lys Glu Asn Tyr. Val 
705 710 715~ 720 

Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin 
725 730 735 

Lys lie Asp Glu Ser Lys Leu Lys Ala Phe Thr Arg Tyr Gin Leu Arg 
740 745 750 

Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He Arg Tyr 
755 760 & 765 



Asn Ala Lys His Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu m Trp 
770 ' 775 780 
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p ro heu Ser Al. Gin Ser Pro lie Oly Lys Cys Gly Glu Pro Ash, Ag 
^ Al, Pro His -leu Glu Trp Asn Pro Asp, U~ Asp Cys Ser «S ; A*#, 
Asp Gly Glu LysCys Al, His Hi, Sor His Hi, «?, £| Asp 1., 

8 2o ^ ; ;■ * ' • > - 

Asp V.1 Gly Cys Thr Asp ,,u Asn Glu Asp leu GXy Vsl Trp V. i Mf ; 
835 ** 4U 

wie Ala Ara Leu Gly Asn Leu Glu 
Phe Lys He Lys Thr Gin Asp Gly His Ala Arg 

850 8bb 

t^i Glv Glu. Ala ■ Leu Ala Arg Val Lys 

Phe Leu Glu Glu Lys Pro Leu Val Gly biu *±<* QQQ 

865 870 

ftwr aqh Tvs Ara Glu Lys Leu Glu Trp Glu 
Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg w.u y ^ 

8 85 



Thr Asn He Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe 

900 90b 
Val Asn Ser Gin Tyr Asp Gin Leu Gin Ala Asp Thr A» He Ala Met 



915 



920. 



n. His Al. Al. Asp Lys Arg V.l His Ser Us Ar ? Glu Al. Tyr Leu 

p„ Glu Leu ssr V.l Us Pro Gly V.l Asn Al, Al, He Phs Glu Glu 
945 . 950 

L eu Glu Gly Arg He " Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn 

965 9/U 
Val lie Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn Val 
980 yab 

Lys Oly Hi, V.l Asp V.l Glu Glu Gin Asn Asn Gin Jr. 8- V.l Leu 
995 xuuu 

V.! v.! Pro Glu Trp Glu Al, Glu V.l ssr Gin «. «1 tt. ^ Cys 

1010 101b - - " 

Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly Tj* 

1 n "3 A J. U O J 



1025 1° 30 



„ i rpv^ -n- His Glu He Glu Asn Asn Thr Asp Glu 
Gly Glu Gly Cys Val Thr He His biu u 1Q55 

1045 1050 

Leu Lys Phe Ser Asn Cys Val Glu Glu Glu He Tyr Pro AsnAsn Thr 

1060 1065 
V.1 Thr Cys Asn Asp Tyr Thr v.1 Asn Gin Glu Glu Tyr Gly Gly Al, 

1075 108U 
Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Glu Ala ProSer Val Pro Ala 

1090 1095 ' 

Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg Arg 
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ins ; <ll2 ° 

1105 1110 



Glu Asn Pro Cvs OlU 5 P h e A sn Ar 9 C jrArg ^^^^^ 
Pro val «y ^ Glu S^^*^ 

'1140 1143 . ,'? ■ v.: f. : : 

, T1 „ Glv Glu Thr Glu Gly Thr Phe;.|le .Val Asp... 

Lys Val Trp He Glu lie Gly Glu_in H65 . 

, 1155 1160 : " , ' • ■- • ' 

Ser Val Glu Leu Leu Leu Met Glu Glu 
1170 11/3 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:. 3534 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..3531 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

S S S2 £ S 25 S S 5 S S ^ S £ = 



i 5 



= s s: s a s s 1 s e s s a s s 

s S S S 5 s = = = s s ss S S = = 

£ S E S 5 S 5 - C = S 2 S S SES 

50 55 

S S S S S S 5 SK S 25 S SE S 5 ST 2 

85 5 

s s s ™ s s = £ s = T £ a s i s s 

100 10b 

_ n p^rr, rrT A CT AAT CCA GCA TTA AGA. GAA 

TCT TTT AGA GAG TGG GAA GCA GAT CCT ACT ^ ^ ^ 

Ser Phe Arg Glu Trp Glu Ala Asp Fro ^ 
115 
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1 



J 



' 432_/~ '. 

135 i*u ;> .,.v ; :vvv 



0,0 ™ oo T * gj - « s - - s s * :?£ £ s 

Glu Met Arg He Gin Phe Asn «. y u() 

130 135 ,v ... • .; 

5 ■ «■ **t tat CAA GTT : CCT CTT TTA TCA GTA ' 

u s = s s s s s = s s ss s ss s s a 

180 lbD 

TAT AAT GAT TTA ACT AGG CTT ATT GGC AAC « Thr X E S S 
Tyr Asn Asp Leu Thr Arg Leu lie Oly Asn Ty ^ 

20 * 195 

n-n» r-AA CGT GTA TGG GGA CCG GAT TCT AGA 

s s E s s s - £ ~ ™ «j « -p - «, 

25 OAT l OTA AGO TAT AAT CAA TTT -» «£ 2 g £ 52 

Asp Trp Val Arg Tyr Asn Gin Phe Arg 9 ^ 240 
225 

„ rrr T AT GAT AGT AGA AGA TAT CCA 

30 TTA OAT ATC GTJ OCT CTG TTC « AAT a ^ ^ ^ ^ 

Leu Asp He vai ^ ^ 250 
245 

ATT COA ACA GTT TCC CAA TTA ACA ACA GAA ATT TAT £ AAC CCA OTA 
35 He Arg Thr Val Ser Gin Leu Thr Arg 27Q 

m „_ nnr TCG GCT CAG GGC ATA GAA 
TTA GAA AAT TTT GAT GGT AGT TTT CGA GGC TCG G^ ^ ^ ^ ^ 
Leu Glu Asn Phe Asp Gly Ser Phe Arg y ^ 
40 275 280 

„ Tirrv^ pat ATA CTT AAC AGT ATA ACC 

AOA AGT ATT AGO AGT CCA CAT TTG ATO OAT ATA ^ ^ ^ 
Arg Ser He Arg Ser Pro His 3Q0 _ 

290 ' ^ 

45 * nr rCT TAT TAT TAT TGG TCA GGG CAT CAA" 

ATC TAT ACG GAT GCT CAT AGG GGT TAT TAT ^ ^ fl 

He Tyr Thr Asp Ala His Arg Gly Tyr ryr ^ 32Q 



305 310 



50 - s s = s s s = = i s s = s i s 

» --5SSSSSSS2S.SSES 

340 34b 

ss s s s s ^ s s = I = £ = « 



340 345 
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528 



576 



624 



672 



720 



816 



864 



912 



960 



1008 



1056 



1104 



25 



30 



35 



AGA CCT TTT AAT ATA GGG AT* gT AA* ^ |* 
Arg' Pro Phe Asn He Gly lie -., jgd" 

■ 370 J • . ... . . >v.:«:V v : '■' '■■■,<• '■■ 

5 « Rrr TTC TCA AAT TTG CCA ..TCC GCT GTA'. 

GGG ACA GAA TTT GCT TAT GGA ACC T|C TC A Q _ Serbia Val 

Giy Thr Glu Phe Ala Tyr Gly Thr Ser Ser . ^ ' 4.0.0 

385". 390 ■ . • • ?<; :.^ v ... .• ".■^-:'^||'> 

» ^ si gs s « s ■ s ss s s a § 

405 

„ /-^ * * rr ft TTT AGT CAT CGA TTA AGC CAT 
AAT AAC AAC GTG CCA CCT AGG CAA GGA TTT AGT ^ 
1«5 Asn Asn Asn Val Pro Pro Arg Gin Gly Phe 43Q 
420 

GTT » .TO TTT COT TO, O f TTT « » AAT £ £ « £ £ £ 
Val Ser Met Phe Arg Ser Gly Phe ser ^ 

AOi OCT CCT .TO TTC TCT TOO ATA CAT COT £ £ £ - J- £ £, 
Arg Ala Pro Met Phe Ser Trp lie n ^ 
450 

™ A mm fi GTA AAA GCA CAT 

s s a = s = s s sa s s = - . - 

465 470 

„- rpr pnn TTT ACG GGA 
«X CTT CAO TCA COT ACT ACT CTT OTA ACA JOO « ^ ^ ^ 
Thr Leu Gin Ser Gly Thr Thr vaj. ^ 495 

»om r-oa CCA CCA TTT GCT TAT ACT ATT ... 
GGA GAT ATT CTT CGA CGA ACA AGT GGA GGA CCA T^ ^ ^ ^ ^ 
Gly Asp He Leu Arg Arg Thr Ser Gly <, y 51Q 
500 

C S S £ 5 S = i SS S £ 5 I S S S 

15 

5 E S S £ S 2 5 S SS- S | S S SS 
45 5 s = S 5 S = S - S 1 5 S 5 S K 

545 

» = s s a s = = ss s s s s s s s s 
55 « s s ss SK = s ss s s s s s S E = 

580 b 

s s s s - s s s s a = I s s s 



ii52_ 



4 



60 " »M 
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1296 



13 44 



1536 



1584 



1776 



«» ACA TTT GAA OCA GM TAT « ™ ™ KSS^ ' ' " ^ 

Ala Thr Phe Glu Ala Glu Tyr asp l, 

5 AAT GCG CTG TTT ,CT TCT- AAC CAA ATA |g ^ AAA |g gM^I^ 
Asn Al« tS. Phe Tht S«.Ile Asn Sin «• |* .,-.M0.-,.' u; • 
625 ^ : 

ota TCC AAT TTA GTG GAT TGT TTA TCA 1?^ 
ACG GAT TAT CAT ATT GAT CAA GTA TCC AAT T Asp C ys Leu Ser 

10 Thr Asp Tyr His lie Asp Gin Val Ser Asn. . . . _ , ^j,,^., 

645 

» »»r rrA GAA TTG TCC GAG AAA GTC AAA 2016 
GAT GAA TTT TGT CTG GAT GAA AAG CGA GAA TTG ^ ^ ^ ^ Lys 
Asp Glu Phe Cys Leu Asp Glu Lys Arg 670 
ic 660 onCA 

rrr ART TTA CTT CAA GAT CCA AAC . 2064 
CAT GCG AAG CGA CTC ACT GAT GAG CGG AAT TTA ^ ^ ^ ^ 

His Ala Lys Arg Leu Ser Asp Glu Arg as ^ 

20 TTC AAA Z ATC AAT AGG CAA CTA GAC CGT GGT TGG AGA GGA AGT ACG ™ 

Phe Lys Gly He Asn Arg Gin Leu Asp Arg y 

690 695 216C 

» s s s s a s s s s s i - - - " » 

30 S a S 5 i = S K S i £ S S = i » 22 ° ! 

rrr TTT ACC CGT TAT CAA TTA AGA 225 
AAA ATC GAT GAA TCA AAA TTA AAA GCC TTT A^ ^ ^ ^ Arg 

Lys He Asp Glu Ser Lys Leu Lys Ala v ?50 

35 740 ->-*n 

™a a-pr TAT TTA ATT CGC TAC 
GGG TAT ATC GAA GAT AGT CAA GAC TTA GAA ATC TAT TTA ^ ^ ^ 
Gly Tyr He Glu Asp Ser Gin Asp Leu Ui ^ 

40 AAT GCA Z CAT GAA ACA GTA AAT GTG CCA GGT ACG GGT TCC TT^ TGG 

Asn Ala Lys His Glu Thr Val Asn Val 780 ~ 

770 " 5 . - - 24( 

« s = s s ss = s s s s i s a s jsi- 
50 ~ s ss s s s s £ s s = = - s i 5 

GftT TO M «o 1 occ ™ His E SI SI s s s » s 

Asp Gly Glu Lys Cys Ala His His ^ Q3Q 

55 ^ 20 

GAT GTA GGA TGT ACA GAC TTA AAT GAG GAG GTA TGG GTG ATC 

Asp Val Gly Cys Thr Asp Leu Asn ta x * g45 
835 04U 

^ aap ACG CAA GAT GGG CAC GCA AGA CTA GGG AAT CTA GAG 
TTT AAG ATT AAG ACG CAA ^ 1 ^ 



2496 



2544 
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Phe tys He Lys Thr Gin Asp Gly His Ala Arg Leu Gly Asn Leu. Glu 
850 855 

2 2 2 S* 3 = .« 5*3 ? SS££. 

s £ s s shla a 5 a a a s'§ a a 

900 905 

s se s: s a = as s-sra s s s 



s s s s s s: s s s s s a s 5 = 

930 93S 



s a s s s s s s s a s s s = a a 

945 950 955 

-n» naa rrr CGT ATT TTC ACT GCA TTC TCC CTA TAT GAT GCG AGA AAT 

2 Glu gS S S S T,r Ala Phe Ser Leu Tyr Asp Ala Arg Asn 
965 970 

s s 2 S S S K a 2 25 ss a 52 2 2 



995 1000 



s 2 ss s a s a s a ss s 2 

ioio 1015 — 

s s s s 5 s 2 s 2 s sss as s& 

1025 1030 

S Si S 5 SS S Ss Si SS 2 2 S £S 
2 K SI 2 £S S 2 SS S S S S£ S 

1060 10bb 

s s z s a - £ Er ~ s " s. s s s 

1 0 oO 



1075 



SS £ S S US S E 5 - - S S 2 5 £ S 
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2640 



2688 



2736 



2784 



2832 



2880 



2928 



2976 



3024 



3072 



3120 



3168 



3216 



3264 



3312 



1095 libo 

a™ n-vr TAT GAA GAA AAA TCG TAT ,ACA, GAT GGA CGA AGA 

™ s s.se " s; ss «- « as**** w.sasr 

1105 1110 ,- ■ ■ ; .it . < .V.-v" \ -v.*. 

GAG AAT CCT «*V « f C ^ «g $ £ S l ^ 

Glu Ash Pro cys Glu Phe Asn Arg Gly.Ty| Arg Asp y , 

1140 114t> 

J- S 3 S - S5S S »« S ft ?S 

1155 1160 

AGC GTG GAA TTA CTC CTT ATG GAG GAA TAG 
Ser Val Glu Leu Leu Leu Met Glu Glu 
1170 1175 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1177 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
Met Asp Asn Asn Pro Asn lie Asn Glu Cys He Pro Tyr Asn Cys Leu 

1 5 
Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg He Glu Thr Gly 
20 25 

Tyr Thr Pro lie Asp He Ser Leu Ser Leu Thr Gin Phe Leu Leu Ser 

35 40 ^ _ 

Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp lie. lie 



50 



55 



Trp Gly He Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin- lie 
65 J 70 75 



Glu Gin Leu lie Asn Gin Arg lie Glu Glu Phe Ala Arg Asn Gin Ala 

85 90 
lie Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin He Tyr Ala Glu 



Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Arg Glu 



115 



120 



lie Gin Phe Asn Asp Met Asn Ser Ala Leu Thr Thr. Ala 



Glu Met Arg He Gin Fne *sn — 

1« 



130 135 
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lie Pro Leu Phe Ala Val Gin Asn Tyr Gin Val Pro Leu Leu Ser Val 
145 150 155 160 

Tyr Val Gin Ala Ala Asn Leu His Leu Ser Val Leu. Arg Asp Val Ser 
165 170 175 

Val Phe Gly Gin Arg Trp Gly Phe. Asp Ala Ala Thr- lie Ash Ser Arg 

180 IBS .' . 190 . 

Tyr Asn Asp Leu Thr Arg Leu lie Gly Asn Tyr Thr. Asp Tyr Ala Val 

195 ' - V 200 ■ ■ ' 205 - > " . 

Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg 
210 215 220 

Asp Trp Val Arg Tyr Asn Gin Phe Arg Arg Glu. Leu Thr Leu Thr Val 
225 230 235 240 

Leu Asp lie Val Ala Leu Phe Pro Asn Tyr Asp Ser Arg Arg Tyr Pro 
245 250 255 

He Arg Thr Val Ser Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val 
260 265 270 

Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly He Glu 
275 280 285 

Arg Ser He Arg Ser Pro His Leu Met Asp He Leu Asn Ser He Thr 
290 295 300 

He Tyr Thr Asp Ala His Arg Gly Tyr Tyr Tyr Trp Ser Gly His Gin 
305 310 315 320 

He Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 
325 330 335 

Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg He Val Ala 
340 345 350 

Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr Arg 
355 360 365 

Arg Pro Phe Asn He Gly He Asn Asn Gin Gin Leu Ser Val Lett-Asp 
370 , 3 75 3 80 

Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala Val 
385 390 395 400 

Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu He Pro Pro Gin 
405 410 415 

Asn Asn Asn Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser His 
420 425 430 

Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser He lie 
435 440 445 

Arg Ala Pro Met Phe Ser Trp He His Arg Ser Ala Glu Phe Asn Asn 
450 455 460 . t 

lie lie Ala Ser Asp Ser He Thr Gin He Pro Leu Val Lys Ala His 
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... 480 
470 475 

465 



, „ , .,:. rl ^, pro Gly Ette Thr Gly 
x Gin Ser Gly Thr Thr Val V«l Arg Gly Pro Gly ^ 

485 : 

50.0 



.515-. 



ti'' v=i Thr Val Ala Gly Glu 

Ala Ser Thr Thr Asn Leu Arg He Tyr Val Thr 

_ ln phe Asn Lys Thr Met Asp Thr . Gly Asp Pro . 
He Phe Ala Gly Gin Pne Asn uy* 555 560 



Arg ~- 55Q 

Z * - «- - s« *r M. ^ "« » - «a £ ». 
565 

Phe Pto « f « «- « - ~ SI val Gly Ala " p ss phe sel 

seI Gly « I v. ^ ne „ « - - - - ™ ~ 

595 ° 

i at a Glu Tyr Asp Leu Glu Arg Ala Gin Lys Ala Val 
Ala Thr Phe Glu Ala Glu Tyr a p 62Q 

ri a riw Tie Lvs Thr Asp vai 
Asn Ala Leu Phe Thr Ser He Asn Gin He Gly 

625 

, u i c QY - Aqn Leu Val Asp Cys Leu Ser 
Th r Asp Tyr His lie Asp Gin Val Ser Asn ^ 
645 

ftsp oi. - c fo «. « - - « «*■ ~ s - slu ss val Lys 
His „, W s "1 - « « ss - - - - ss « ~ Asn 
Phe w . X n« » « «. - - « «* ?o? " 3 Gly s - *• 

„ r. «. Gl „ «. «» « « - ^ «. «» - 

Z L» « -V tg «- «P «" TVJ Pro T, r ^ «- «- 

Lys ne *sp 01. I- W - , y s «. «r «. Tyr «£ 
740 ' 

,*-«!«: Gin Asp Leu Glu He Tyr Leu He Arg Tyr 
Gly Tyr He Glu Asp Ser Gin Asp be ?65 

755 



U . Ly5 HXs «» - »J - « « S»' Gly ~ 

770 

Leu ser Ala Gin Ser Pro He Gly Lys Cys Gly Glu Pro Asn Arg 



Pro ~ 

785 
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• t , Glu Trp Asn Pro Asp ,Leu Asp Cys Ser Cys *rg : . • 
Cys Ala Pro His Leu Glu Trp * gio . 81 - 

80.5 ' <■« v: 

Mp Gly - <:,- - «s - fe - - ~*M7M- I 

820 V " ; : : v- 

Kp Val sly Cys Thr * W & ** 1^S# 

^ Z OXU OX- *- - Leu Val CXy «U - - *» V. Jg 

15 Lvs Arg Glu Lys Leu Glu Trp Glu 

Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg V 895 

88b 

.*30F o1 Lvs Giu ser Val Asp Ala Leu Phe 

, Tlo Va i Tvr Lys Glu Axa Lys v>j.u * 
20 Thr Asn He val ry* J g05 ?xv 

900 

„. IO „ rln A ia Asp Thr Asn He Ala Met 
Val Asn Ser Gin Tyr Asp Gin Leu Gin , P 

25 He His "a Ala Asp Lys Arg Val His Ser Xle Arg Glu Ala *r Leu 

Pro Clu Leu Ser Val Xle Pro Gly Val Asn Ala Ala Xle Phe Glu Glu 

30 Teu Glu Gly A, Xle Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn 

35 Val Xle Lys Asn Gly Asp Pne Asn Asn Gly Leu Ser Cys Trp Asn Val 

Lys G ly His Val ASP Val Glu G.Gln Asn Asn Gin Arg^er Val Leu 

40 val Val Z Glu Trp Glu A.Glu Val Ser Gin G.Val Arg ValCys 

Pro Gly Tyr X.Leu Arg Val Thr Al^Tyr Lys Gl, ^ 

45 rGlu Gly cys Val 5 T h r Xle His Glu X.Glu Asn Asn Thr As.Glu 

50 Leu Lys Pne S.Asn Cys Val Glu G.Clu Xle Tyr Pro A.Asn Thr 

' val Thr Cys Asn Asp Tyr Thr Val Asn Gin Gl. Glu Ty^Gly Gly Ala 
1075 

in Ala Pro Ser Val Pro Ala 



55 



Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Gl 
1090 1 



ftsp Tyr Ser ».l Tyr <"» 01. W - 
60 H05 



Glu Lys Ser Tyr^Thr Asp Gly Affl.^ 
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Glu « ™ c - - - h 

1125 

wi w'PhP pro Glu Thr Asp 
P„ V,! M» Tvr V.1 Thr Lys «» "-J**** ^ U ? 0 ^ , 

1140 • 4 ' >' . - •: . "V. . ; •:• .... . " , 

, Glu-Thr Glu Gly Thr Phe -ile vkl Asp ../ 

Lys Val Trp He Glu lie Gly Glu^hr 1165 : • .. • 

. 1155 . ■;• v •■ ••; ; : 

Ser Val Glu Leu Leu Leu "Met .Glu Glu • ^ ■ " ,\ ^ ,. ......J 



50 



55 



1170 



(2) INFORMATION FOR SEQ ID NO :29: 

( i 1 * SEQUENCE CHARACTERISTICS: 
( (A) LENGTH: 3579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ixi FEATURE: 
UX) (A) NAME/KEY: CDS 

(B) LOCATION: 1. .357* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

= s s a s s s = 1 s Si S S K s s 
tac acc » « « s s ™ s s s si s 22 s 

Tyr Thr Pro He Asp He ber ^ 45 

GAA TTT Z CCC GGT «T » ™ « ™ & S 32 E SJ5 
Glu Phe Val Pro Gly Ala Gly P»e v gQ 

50 - rr* TTT CTT • GTA CAA ATT 

» S S £ S S S S Si S E S - - v., «- xj. ■ 
« - ™ ;r E Si S S Si Si S S S E Si K 

Glu Gin Leu He Asn Gin Arg i 9b 
85 

» - r s si s s ss s = - - s i s s 

He Ser Arg Leu Glu i*xy u liu 
100 

TCT TTT AGA GAG TGG GAA GCA GAT OCT ACT AAT CCA OCA TTA AGA » 
Ser Phe Arg Glu Trp Glu Ala Asp 1?5 

GAG ATG CGT ATT CAA TTC AAT GAG ATG AAC AGT GCG CTT ACA ACC. GCT 
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Glu Met Arg lie Gin Phe Asn Asp Met Asn Ser Ala Leu Thr Thr Ala 
130 " 135 140 

ATT CCT CTT TTT GGA GTT GAA AAT TAT CAA GTT CCT CTT TTA TCA GTA 

lie Pro Leu Phe Ala Val Gin Asn Tyr Gin Val Pro Leu Leu Ser Va;l 

145 - 150 : ■ ;'• 5 . y ; .■> • 

TAT GTT CAA GOT GCA AAT TTA CAT TTA TCA GTT TTG AGA GAT GTT TCA 

Tyr Val Gin Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp c Val 'Ser : 

165 . i7;P\ . , .17 5 

GTG TTT GGA CAA AGG TGG GGA TTT GAT GCC GCG ACT ATC AAT AGT CGT 

Val Phe Gly Gin Arg Trp Gly Phe Asp Ala Ala Thr lie Asn Ser Arg 
180 185 .190 

TAT AAT GAT TTA ACT AGG CTT ATT GGG AAG TAT ACA GAT TAT GCT GTA 

Tyr Asn Asp Leu Thr Arg Leu lie Gly Asn Tyr Thr Asp Tyr Ala Val 
195 200 205 

CGC TGG TAC AAT ACG GGA TTA GAA CGT GTA TGG GGA CCG GAT TCT AGA 

Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg 
210 215 220 

GAT TGG GTA AGG TAT AAT CAA TTT AGA AGA GAA TTA ACA CTA ACT GTA 

Asp Trp Val Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 
225 230 235 240 

TTA GAT ATC GTT GCT CTG TTC CCG AAT TAT GAT AGT AGA AGA TAT CCA 

Leu Asp He Val Ala Leu Phe Pro Asn Tyr Asp Ser Arg Arg Tyr Pro 

245 250 255 

ATT CGA ACA GTT TCC CAA TTA ACA AGA GAA ATT TAT ACA AAC CCA GTA 

He Arg Thr Val Ser 'Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val 
260 265 270 

TTA GAA AAT TTT GAT GGT AGT TTT CGA GGC TCG GCT CAG GGC ATA GAA 

Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly He Glu 
275 280 285 

AGA AGT ATT AGG AGT CCA CAT TTG ATG GAT ATA CTT AAC AGT ATA ACC 

Arg Ser lie Arg Ser Pro His Leu Met Asp He Leu Asn Ser He Thr 

290 295 300 _ 

ATC TAT ACG GAT GCT CAT AGG GGT TAT TAT TAT TGG TCA GGG CAT CAA 

He Tyr Thr Asp Ala His Arg Gly Tyr Tyr Tyr Trp Ser Gly His Gin 
305 310 315 3-2 a 

ATA ATG GCT TCT CCT GTA GGG TTT TCG GGG CCA GAA TTC ACT TTT CCG 

He Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 

325 330 335 

CTA TAT GGA ACT ATG GGA AAT GCA GCT CCA CAA CAA CGT ATT GTT GCT 

Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg lie Val Ala 
340 345 350 

CAA CTA GGT CAG GGC GTG TAT AGA ACA TTA TCG TCC ACT TTA TAT AGA 

Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu- Ser Ser Thr Leu Tyr Arg 
340 345 350 

AGA CCT TTT AAT ATA GGG ATA AAT AAT CAA CAA CTA TCT GTT CTT GAC 

Arg Pro Phe Asn He Gly He Asn Asn Gin Gin- Leu Ser Val Leu Asp 
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370 



375 



380 



„ tCfl « * « - ~ « s S S2 3 S SSS 

Gly Thr Glu Phe Ala Tyr Gly "inr 3 95 .. . 400 

385 39 



GAA ATA CCG CCA CAG 
Glu.Ile'Ero Pro Gin 

. : .;• . ■ 415 - 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



TAC AGA AAA AGC GGA ACG GTA GAT TCG CTG GAT 
Tyr Arg Lys Ser Gly Thr Val Asp ^ . 

„ ' a(A TTT AGT CAT CGA TTA AGC CAT 

js ; s - 1 c f " s s ss I s - - - a s " Hls 

GTT TCA ATG « CO, - « « £ £ S «fi S S 
Val Ser Met Phe Arg Ser Gly Phe * ^ 

AGA OC « ATG «C S - ATA OA, CGT - GGA ACT « Thr Aso 
Arg Ala Pro Met Phe Ser Trp lie 46Q 
450 

* *m* PPT TTA GTG AAA GGA TTT 
ACA ATT GAT CGA GAG AGA ATT AAT GAA ATA OCT « ^ 
Thr He Asp Pro Glu Arg He Asn 475 480 

Hi GTT TGG GGG GGG AGC TOT GTG ATT AGA GGA GGA GGA TTT 
Arg Val Trp Gly Gly Thr ber v 4gQ 49 b 

485 

„ GAT ATG GTT GGA AGA AAT AGC TTT GGT GAT TTT GTA TGT GTA GAA 
Gly Asp He Leu Arg Arg Asn Thr Pne y 51Q 
500 

™* a rA TAC CGT TTA AGA TTT CGT 
GTC AAT ATT . AAT TCA CCA ATT ACC CAA AGA TAC ^ phe Arg 

Val Asn lie Asn Ser Pro He Tnr 525 
515 

ss s s = 5 s s s s s s = s - s 
s s s s s s s s = s s.s; ^ 

545 550 



1200 



1248 



,12916: 



1344 



1392 



1440 



1488 



1536 



SCT ATG GAA ATA GGG GAG AAC TTA AGA TCT AGA AGA TTT AGA TAT AGC 
Thr Met Glu He Gly Glu Asn u ^ 5 75 

GflT TTT AGT AAT CCT TTT TCA TTT AGA GGT AAT CCA GAT ATA ATT « 
Asp Phe Ser Asn Pro Phe Ser Phe Arg 590 

~™ nr-T TPT ATT AGT AGC GGT GAA 
ATA AGT GAA CAA CCT CTA TTT GGT GCA GGT TGT ATT ^ ^ ^ ^ 
He Ser Glu Gin Pro Leu Phe Gly Al 605 
595 

CTT TAT ATA GAT AAA ATT GAA ATT ATT GTA »» Asp S 52 ^ 
Leu Tyr He Asp Lys He Glu He ^ 
610 615 



1584 



1632 



1680 



1728 



1776 



1824 



1872 
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2256 



GCA GAA TCT OH TTA GAA AQA GCA OA AAG og ^ AAT fl£<* *» 
AU Olu Set Asp Leu Glu Arc, Ale. Gin Lys Ala V W 

625 ., ^ y •>•:.; 

5 ACT TCT TCC AAT CAA ATC GOG TTJ AAA Ag : GAT GT, . ^ ^ „ 

Thr Ser Ser Asn Gin He Gly Leu uy 655 
645 •.<•. ."v . ,,. ^ 

' '•■ • „ tvt tta TC A ' GAT- 'GAA TTT'. TGT 

,0 ATT GAT CAA GTA TCC AAT TTA GTG . ; |AT TGT TTA TC£. ^ ^ . . # 

lie Asp Gin Val Ser Asn Leu Val Asp 670 
660 00 

~ »»t> ssi CAT GCG AAG CGA 

CTG GAT GAA AAG CGA GAA TTG TCC GAG AAA GTC AAA ^ ^ ^ 

15 Leu Asp Glu Lys Arg Glu -Leu Ser ^ 685 

CTC AGT Z GAG CGG AAT TTA CTT CAA GAT CCA AAC TTC AGA GGG ATC 
Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp ^ 
20 690 695 

705 710 
25 m/-. a a a rAC AAT TAC GTC ACA CTA CCG GGT 

25 CAA GGA GGA GAT GAC GTA TTC AAA GAG AAT ^ ^ ^ ^ Qly 

Gin Gly Gly Asp Asp Val Phe Lys u 735 
725 

*nr TAT TTA TAT CAG AAA ATA GAT GAG 
30 £ £ S SS S S Si S S S Gin «. 11. «. 

740 /4D 

mTvm nATA TTA AGA GGG TAT ATC GAA 
TCG AAA TTA AAA GCT TAT ACC CGT TAT GAA TTA AG^ ^ ^ ^ ^ 
35 Ser Lys Leu Lys Ala Tyr Thr Arg Tyr ^ 

755 

rrpo rr-T TAC AAT GCA AAA CAC 
GAT AGT CAA GAC TTA GAA ATC TAT TTG ATC GGT TAG AA_ ^ ^ ^ 
Asp Ser Gin Asp Leu Glu lie ryr ?80 
40 770 ' „■ 

arr CGT TCC TTA TGG CCG CTT TCA_GCC 

S5 S SS « ~ S S S S « ^ « « «. - - 

« I AGT CCA ATC GGA AAG TGT GGA GAA CCG AAT CGA TCC GCG CCA CAC ■ 

Gin Ser Pro He Gly Lys Cys Gly &l 815 
805 

«n» rvr TGT TCC TGC AGA GAC GGG GAA AAA 

50 s s: s r„ s s a » s ~ ^ „ « «* «« - 

820 

„ p,t pat TTC ACC TTG GAT ATT GAT GTT GGA TGT 
TGT GCA CAT CAT TCC CAT CAT TTC ACC n , Asp Val Gly Cys 

55 Cys Ala His His Ser His His Phe g45 
83 5 

, C A GAC TTA AAT GAG GAC TTA GGT GTA TGG GTG ATA TTC AAG ATTMO 
Thr Asp Leu Asn Glu Asp Leu Gly v gg0 



2016' 



2064 



2112 



2304 



2496 



60 850 855 
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2544 



2592 



30 



35 



40 



ftCG CAA GAT GCC CAT GCA r CTJ « W « SygSfiSfc 
Thr Gin Asp Gly His Ala, Arg Leu Gly Asn ueu * &m 

865 810 ! .i , : ,t 

'885 . . ' - f ' 

10 Lys Tirp Arg Asp Lys Arg Glu Lys. ^eu ta x . 9 10 ; - 

900 9Ub "T'.*" 

TAT AAA GAG GCA AAA GAA TOT GTA GAT GOT TTA TTT GTA AAC TCT CAA 

Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Pne . 
15 915 920 

TAT GAT AGA TTA CAA GTG GAT ACG AAC ATC GCA ATG ATT CAT GCG GCA 

Tyr Asp Arg Leu Gin Val Asp Thr Asn lie Ala Met. 

20 GAT 2 CGC GTT CAT AGA ATG COG GAA GCG TAT CTG CCA GAG TTG TCT 

Asp Lys Arg Val His Arg He Arg Glu Ala ryr ^ 
945 950 

™ „„,n aat rrc GCC ATT TTC GAA GAA TTA GAG GGA CGT 
25 GTG ATT CCA GGT GTC AAT GCG GCC ATI u Gly Arg 

Val He Pro Gly Val Asn Ala Ala He Phe Glu fa ^ 

965 970 
ATT TTT ACA GCG TAT TCC TTA TAT GAT GCG AGA AAT GTC ATT AAA AAT 
lie Phe Thr Ala Tyr Ser Leu Tyr Asp Ala Arg Asn va ^ 

980 98b 
CCC GAT TTC AAT AAT GGC TTA TTA TGC TGG AAC GTG AAA GOT CAT GTA 
Gly Asp Phe Asn Asn Gly Leu L.u^Cy. Trp Asn 

CAT GTA Hi GAG CAA AAC AAC CAC GGT TCG GTC CTT GTT ATC CCA GAA 
Asp Val Glu Glu Gin Asn Asn His Arg Ser va ^ 

1010 1015 
TGG GAG GCA GAA GTG TCA CAA GAG GTT CGT GTC TGT CCA GGT CGT GGC 
Trp Glu Ala Glu Val Ser Gin Glu Val Arg Val^Cys __ 1040 
1025 1030 
45 TAT ATC CTT CGT GTC ACA GCA TAT AAA GAG GGA TAT GGA GAG GGC TGG _ 

Tyr He Leu Arg Val Thr Ala Tyr Lys Glu faiy Y iQ55 

1045 lut>u 
GTA ACC ATC CAT GAG ATC GAA GAC AAT ACA GAC GAA CTG AAA TTC AQC 
50 Val Thr He His Glu He Glu Asp Asn Thr Asp fa y^ 

1060 1065 

s s a s s &s £ := s ss s s 

55 , 1075 loyu 

s s? s s s a ss s s K s s ss s s s 

1090 1095 — ' 

60 AAT CAA GGA TAT GAC GAA GCC TAT GGT AAT AAC CCT TCC GTA CCA GCT 3360 



2688 



273:6 



2784 



2832 



2880 



2928 



2976 



3024 



3072 



3120 



3168 



3216 



3264 



3312 
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Asn Gin Gly Tyr Asp Glu Ala Tyr Gly Asn Asn^Pro Ser V.l Pro |1^ 

1110 ll-is. 



1105 



GAT TAC GOT TCA GTC TAT GAA GAA AAA TCG TAT ^ ^ <^<^ ^ ; , 
Asp Tyr Ala. Ser Val Tyr Glu Glu Lys Ser^Tyr Thr ^ r y : :^^p^, 

GAGAAT OCT TGT GAA TOT AAC ? A /GGC TAT GGC J GAT Thr" Pr o Leu 

Glu Asn Pro Cys Glu Ser, Asn Arg Gly Tyr Gly Asp.Tyr 

114% 1145 ■ .. M ■ ' ; ••.--•;> ki-. j,.-'A,i 

CCG GCT GGT TAT GTA ACA AAGGAT ^^^^^^^ Z 
Pro Ala Gly Tyr Val Thr Lys Asp Leu Glu Tyr Pne 
1155 1160 

5 si s s s= s s a £ srt5*t s asp 

1170 1175 

AGC GTG GAA TTA CTC CTT .ATG GAG GAA. 
Ser Val Glu Leu Leu Leu Met Glu Glu 
1185 ll 90 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:30: 

Met Asp Asn Asn Pro Asn He Asn Glu Cys He Pro Tyr Asn Cys Leu 



Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg He Glu Thr Gly 

Ser Leu Thr Gin Phe Leu Leaser 
45 ... 

Gly Ala Gly Phe Val Leu Gly Leu Val Asp He II.- 
55 60 
Trp Gly He Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin lie 

Glu Gin Leu lie Asn Gin Arg He Glu Glu Phe Ala Arg Asn Gin Ala 

85 90 
lie Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin He Tyr Ala Glu 

100 105 
Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Arg Glu 

Glu Met Arg He Gin Phe Asn Asp Met Asn Ser Ala Leu Thr Thr Ala 



20 25 

Tyr Thr Pro He Asp He Ser Leu ^ 
35 , 40 

Glu Phe Val Pro Gly Ala Qixy rne y«* — — --- 
50 55 
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1 



130 135 ... 140 



He Pro Leu Phe Ala Val Gin Asn Tyr Gin Val Pro Leu Leu Ser Val 
145 150 15 

Tyr Val Gin Ala; Ala Asn Leu His -L.u Ser Val Leu Arg .;As R Val Ser 

165 : ; v s : . , : * -. • ■ . 

Val Phe Gly Gin Arg Trp Gly Phe. Asp Al. Ala Thf He *sn Se^,, 

180 : 185 . / . ; < / ^ v ... 

Tyr Asn Asp Leu Thr Arg Leu lie Gly Asn Tyr Thr Asp Tyr Ala Val 

Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro A S p Ser Arg. 

210 215 
ASP Trp Val Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 
225 ^® 

L eu Asp He Val Ala Leu Phe Pro Asn Tyr Asp Ser Arg Arg Tyr Pro 
245 ZDU 

lie Arg Thr Val Ser Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val 

260 ^ bb 

r Qor . p>ve Ara civ Ser Ala Gin Gly He Glu 
Leu Glu Asn Phe Asp Gly Ser Phe Arg oiy 



27 5 



280 



A r g ser He «, Ser Pro His L. Met Asp He «u Asn Ser He T„r 

290 2 " 
He Tyr Thr Asp Ala His Arg Gly Tyr Tyr Tyr Trp Ser Gly His Gin 
305 310 



Xle Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 

325 330 
L eu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg lie Val Ala 

3 40 34b 
Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr_Arg 

340 34b 
Arg Pro Phe Asn He Gly He Asn Asn Gin Gin Leu Ser- Val Leu Asp 

370 375 
Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala Val 

Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu He Pro Pro Gin 

405 41U 
Asn Asn Asn Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser His 



420 



425 



Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser 



435 



440 



Val Ser He He 
445 



Phe Ser Trp He His Arg Ser Ala Thr Leu Thr Asn 



Arg Ala Pro Met Phe ber irp — — ~* 46Q 

450 45b 
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<g n. «p ft. 8i. « »« «* Ile I?? ^' Val lyS ' ■ 

Z vai Trp «y «v - - «' -» m :!!y "° ^ 

485 ■ :• ; " : ■. . ■ 

' Rcr , Thr Phe Gly Asp Phe' Val §er ; E,eu Gin. 
Gly Asp He Leu Arg Arg Asn Thr Phe Giy P . , • ., 5*0, 

val M „ ne » «« « - - ^ 

Val He Val Leu Thr Gly Ala Ala 



515 520 



Tyr Ala Ser Ser Arg Asp Ala Arg ^ 
530 



ser ^ „ V,! «, f V Oln V,! Ser V,! Asn Met Pro Leu «- JJJ 

it >i c 550 
545 

Thr Met «. n. «g «. M- «- «■* tr„ Arg Thi phe At9 s? 

„p Phe ser Asn Pro Phe Ser Phe Ar ? Ale Asn Pro Asp Jg W 

580 bbD 
Ile ser Olu am Pro Leu Phe Cly Ale «y Ser He Ser Ser Oly Olu 
595 bUU 

... rlu Ile Ile Leu Ala Asp Ala Thr Phe Glu 
Leu Tyr He Asp Lys He Glu lie ne ^ 

610 615 
J!. Oiu S.r Asp Leu Olu «. Al. «• & ^ ~ «° 

Tr S.r Ser Asn Oln He Oly Leu Lys Thr Asp « Thr Asp Tyr His 
645 

_ Te> „ Val Asp C ys Leu Ser Asp Glu Phe Cys 
lie Asp Gin Val Ser Asn Leu Val Asp cy g70 

660 bto 
Le „ *sp Olu Lys Ar, OXU Lee Ser 01. Lys Vol Lys His Ale Lys « 
675 680 



L eu ser Asp Oiu « Asn Leu « «• »* "° & ^ 
As „ « Oln Pro Asp «. Oly Trp Ar g Oly Ser Thr ASP He Thr -II. 



705 710 



Oi» Oly Oly Asp ASP val Phe Cys 01. « Tyr Val Thr Leu Pro Oly 



Thr v,l A,p Olu oys Tyr Pro Thr Tyr Leu Tyr Oln Lys II. Asp Oln 

740 /4D 

Thr Arg Tyr Glu Leu Arg Gly Tyr He Glu 

Ser Lys Leu Lys Ala Tyr Thr Arg ly i?65 

755 /bU 

, m. Tl<=> Ara Tyr Asn Ala Lys-His 

As p Ser Gin Asp Leu Glu lie Tyr Leu He Arg Ty^ 

770 
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Glu lie Val Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala 
785 790 • 795 . 

Gin Ser Pro He Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala |J| *IV . 

805 . . 8 i L0 .-, . -: ? 

Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys W*sp GlyJitrLysT ' 



820 



825 • 



Cys Ala His. His Ser His His Phe Thr Leu Asp -ni Asp,^ ^ C f . , 



835 



840 



Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val lie Phe Lys lie Lys 

850 855 860 

Thr Gin Asp Gly His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu 



865 



870 



Lys Pro Leu Leu Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu Lys 



885 



Lys Trp Arg Asp Lys Arg Glu Lys Leu Gin Leu Glu Thr Asn lie Val 



900 



Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gin 

915 920 925 

Tyr Asp Arg. Leu Gin Val Asp Thr Asn lie Ala Met lie His Ala Ala 



930 



ASP Lys Arg Val His Arg He Arg Glu Ala Tyr Leu Pro Glu Leu Ser 
Val He Pro Gly Val Asn Ala Ala He Phe Glu Glu Leu Glu Gly Arg 



965 



lie Phe Thr Ala Tyr Ser Leu Tyr Asp Ala Arg Asn Val lie Lys Asn 



980 



Gly Asp Phe Asn Asn Gly Leu Le^Cys Trp Asn Val Lys^Gly His Val 



995 



Asp Val Q Glu Glu Gin Asn AjnHi. Arg Ser Val LeuVal He Pre-Qlu 

Trp Glu Ala Glu Val Ser Gin Glu Val Arg Val Cys Pro Gly Arg Gly- 
1025 ' 1° 30 1035 

Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys 
1045 1050 

Val Thr lie His Glu He Glu Asp Asn Thr Asp Glu Leu Lys Phe Ser 

1060 1065 1 

Asn Cys V.l Glu Glu Glu Val Ty^Pro Asn Asn Thr Val^Thr Cys Asn 

Asn Tyr Thr Gly Thr Gin Glu Glu Tyr Glu Gly ThrTyr Thr Ser Arg 

1090 1095 — " 

Asn Gin Gly Tyr Asp Glu Ala Tyr Gly Asn Asn Pro Ser Val Pro Ala 
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1110 1115 ^ - ^ 1^20 

Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg^ 



1125 1130 



Glu Asn Pro Cys 

";' ii4Q 



61u Ser Asn Arg Gly Tyr &^M§^i 



1150 



Pro Ala Gly Tyr Val Thr Lys Asp .Leu Glu Tyr Phe Pro^ Thr ;Asp, 
10 . '! 115-5 . , ,1160 - • ■ :^; :; ' ( , ; V _, 

Ly s Val Trp lie Glu He Gly Glu Thr Glu Gly Th^Phe lie Val Asp 
1170 1175 

15 Ser Val Glu Leu Leu Leu Met Glu Glu 

1185 1190 
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All of the compositions and methods disclosed and claimed herein can be iftade 
and executed without undue experimentation in light of the preseht disclosure. While the 
compositions and methods of this invention have, befen d^scrib^d in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied 
to the compositions and methods and in the steps or hf the sequence of steps of the. 
method described herein without departing from the ebncept, Spirit and'rscope of the 
invention. More specifically, it will be apparent that certain agents which are both 
chemically and physiologically related may be substituted for the agents described herein 
while the same or similar results would be achieved. All such similar substitutes and 
modifications apparent to those skilled in the art are deemed to be within the spirit, scope 
and concept of the invention as defined by the appended claims. 
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